PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022769607.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Helicteroideae; Durio
Family WRKY
Protein Properties Length: 1840aa    MW: 210438 Da    PI: 6.9806
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022769607.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY79.34.3e-2512821341158
                      ---SS-EEEEEEE..--TT-SS-EEEEEE-STT---EEEEEE-SSSTTEEEEEEES--SS CS
            WRKY    1 ldDgynWrKYGqK..evkgsefprsYYrCtsagCpvkkkversaedpkvveitYegeHnh 58  
                      ldDgy+WrKYG+K  +v+g+++pr YY+C++ gCp+kk+ er+ +d++++++tYeg Hnh
  XP_022769607.1 1282 LDDGYRWRKYGKKkkSVQGNPHPRCYYKCSTMGCPAKKRFERDYQDTSFLITTYEGVHNH 1341
                      59*********85227******************************************** PP

2WRKY86.32.7e-2714841544259
                      --SS-EEEEEEE--TT-SS-EEEEEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS- CS
            WRKY    2 dDgynWrKYGqKevkgsefprsYYrCtsa...gCpvkkkversaedpkvveitYegeHnhe 59  
                      dDg++WrK GqKe+ gs++pr+YYrCt++   +C+++k+v+rs++dp+++eitY+g+H+++
  XP_022769607.1 1484 DDGFSWRKCGQKEILGSKYPRAYYRCTHRnvqDCMATKQVQRSDDDPTIFEITYHGRHTCT 1544
                      8***************************98999**************************96 PP

3WRKY81.21.1e-2516511711259
                      --SS-EEEEEEE--TT-SS-EEEEEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS- CS
            WRKY    2 dDgynWrKYGqKevkgsefprsYYrCtsa...gCpvkkkversaedpkvveitYegeHnhe 59  
                      dDg++WrK GqKev g+++pr+YYrCt++   +C+++k+v+rs++dp+++eitY g+H+++
  XP_022769607.1 1651 DDGFSWRKCGQKEVLGTKYPRAYYRCTHRnvqNCWATKQVKRSDDDPTIFEITYCGRHTCT 1711
                      8***************************99999**************************96 PP

4WRKY25.72.3e-0818181840224
                      --SS-EEEEEEE--TT-SS-EEE CS
            WRKY    2 dDgynWrKYGqKevkgsefprsY 24  
                      dDg++WrKY qKe+ gs+++rsY
  XP_022769607.1 1818 DDGFSWRKYEQKEILGSKYTRSY 1840
                      8*********************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1840 aa     Download sequence    
MQQSFPKASG SRLPELRKKE EWAAKEIHLA DDKHKVSELP KSPNYSSLIA LYLQGNYERT  60
AVPPLFFRRM ALLQVLDLSH TSIKSLPKSL PKLVSLKKLS LRGCELFMEL SPQVGKLKNL  120
EELDLDETQI IDLPSEIGRL VKLSHLRVSF YHICGKKKSK SNFVIHPEAI SVLSQLAELS  180
IDVNPADKRW DDSVEAVVKE ACNSKTLKTL SLYLPKFQLL DNISLIYPSL PHFKFTVGHH  240
KRRIISRVPH EVEAEFRNWD KCLKFVNGES IPIEIEAVLK YSSSFFLDNH ATAMNLSEFG  300
IENMKRLKFC LLAECNKMET LIDGQMHDER NEDDQSESDT GSAEHVLESL EYLSIYYMEN  360
LWSLWRGPNR CGCMSRLKFL ALHTCPQLRH IFSRTLFENF VNLEEIIVED CPQVTSLVSR  420
ASVKPMMSNK FFPSLKRLLL LYLPRLVSIS NGLLITPKLE SIGFYNCPKL KSISKVELSS  480
KTLKIIKGER QWWEDLNWNE TEWGNRPDYL MHMFSPISNE KDVMTQLTED RDLLEATIQN  540
VGQQQEYFVR QDLLDLTESV HHNSAKMQQS VPMASGSGLP KLRKEEEWAA KEIHLTDDKH  600
NVFELPKSPY CSSLIALYLQ GNYKLTAIPP QFFRRMALLQ VLDLSHTSIK SLPKSLPKLV  660
SLKKLWLRGC ELFMELSPQV GKLKNLEELY LDETQIMDLP SEIGKLVKLS HLRVSFYHSC  720
GKKKSKSNFV IHPETISVLS QLAELSIEVN PADKRWDDLV EAVVKEVCNS KTLKTLSLHL  780
PKFQLLDNLS LIYPSLSHFS FTVGHRKNRI VSRVPYEVEA EFRNWDKCLK FVNGENIPIE  840
IEAVLKYSSS FFLDNHATAM NLSEFGIKNM KGLKFCLLAE CNKMETLIDG EINDERNEDD  900
QSKSDLGSAE HLLESLEYLS IYYMENLWSI WRGRNRYGCM SKLKFLALHT CPQLRNIFSH  960
TLLENFVNLE EIIVEDCPQV TSLVSHASVK PMMSNKFLPS LKRLLLLYLP GLISISNGLL  1020
IAPKLESIGF YNCPKLKSLS KMELSSKTLK IIKGECQWWE DLNWNETERG TRPDYLMRIF  1080
TPIRNEKDVM TQLTEDRDLL DATIQNEGQQ QDDEKLLEVS TEDHKHQCSG NCGSLLLDYK  1140
EERIPGTDVT KSPSSCILPS NPLTGTNVTK CPSACILPSN SWTGTDLTNS SSSCILPFNP  1200
LRTFDAPKQA LSFFSSEKNK RLEDCYFDQA AEICEVDVDE DEPKAKRSNC TENENKGVIG  1260
PVSKTTRGHR VAVRTRSNSV VLDDGYRWRK YGKKKKSVQG NPHPRCYYKC STMGCPAKKR  1320
FERDYQDTSF LITTYEGVHN HGCYNMRLYN LHTRLCNDHR ANYMEDAADS AKTISPTKGY  1380
EDVFEAPIQD IGVQPEYEGI PEALIRDESQ QSADPQPSEL AIRMSESPPS PIGSTPWSEV  1440
YDCDFKEQEL KDDSDKRLLC CRRNLSRWKE LIRVPSTGLE VPPDDGFSWR KCGQKEILGS  1500
KYPRAYYRCT HRNVQDCMAT KQVQRSDDDP TIFEITYHGR HTCTLASHVV PSPGPLENQD  1560
HGTSSVRSTY WKVFSMLNCN TSLVAEPQPS EQAIRMSDSQ PPRIGSTPRS EVHDCDFEEQ  1620
ELKGDSKKRK TPSRWTELIR VPSTGLEVPP DDGFSWRKCG QKEVLGTKYP RAYYRCTHRN  1680
VQNCWATKQV KRSDDDPTIF EITYCGRHTC TLASHVVPSP GPLKNQDQGT CSVLSTYCKA  1740
FSMLNCNTSL VAEAQPSEQA IRMSESQPPR IGSTPQSEVH DCDFEEQELE DDSKKRKTLS  1800
RWTELIRVPS TGLEVPPDDG FSWRKYEQKE ILGSKYTRSY
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
116221630LKGDSKKRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G11070.14e-38WRKY family protein