PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022769612.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Helicteroideae; Durio
Family WRKY
Protein Properties Length: 1839aa    MW: 210367 Da    PI: 6.9806
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022769612.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY79.34.3e-2512821341158
                      ---SS-EEEEEEE..--TT-SS-EEEEEE-STT---EEEEEE-SSSTTEEEEEEES--SS CS
            WRKY    1 ldDgynWrKYGqK..evkgsefprsYYrCtsagCpvkkkversaedpkvveitYegeHnh 58  
                      ldDgy+WrKYG+K  +v+g+++pr YY+C++ gCp+kk+ er+ +d++++++tYeg Hnh
  XP_022769612.1 1282 LDDGYRWRKYGKKkkSVQGNPHPRCYYKCSTMGCPAKKRFERDYQDTSFLITTYEGVHNH 1341
                      59*********85227******************************************** PP

2WRKY86.32.7e-2714831543259
                      --SS-EEEEEEE--TT-SS-EEEEEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS- CS
            WRKY    2 dDgynWrKYGqKevkgsefprsYYrCtsa...gCpvkkkversaedpkvveitYegeHnhe 59  
                      dDg++WrK GqKe+ gs++pr+YYrCt++   +C+++k+v+rs++dp+++eitY+g+H+++
  XP_022769612.1 1483 DDGFSWRKCGQKEILGSKYPRAYYRCTHRnvqDCMATKQVQRSDDDPTIFEITYHGRHTCT 1543
                      8***************************98999**************************96 PP

3WRKY81.21.1e-2516501710259
                      --SS-EEEEEEE--TT-SS-EEEEEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS- CS
            WRKY    2 dDgynWrKYGqKevkgsefprsYYrCtsa...gCpvkkkversaedpkvveitYegeHnhe 59  
                      dDg++WrK GqKev g+++pr+YYrCt++   +C+++k+v+rs++dp+++eitY g+H+++
  XP_022769612.1 1650 DDGFSWRKCGQKEVLGTKYPRAYYRCTHRnvqNCWATKQVKRSDDDPTIFEITYCGRHTCT 1710
                      8***************************99999**************************96 PP

4WRKY25.72.3e-0818171839224
                      --SS-EEEEEEE--TT-SS-EEE CS
            WRKY    2 dDgynWrKYGqKevkgsefprsY 24  
                      dDg++WrKY qKe+ gs+++rsY
  XP_022769612.1 1817 DDGFSWRKYEQKEILGSKYTRSY 1839
                      8*********************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1839 aa     Download sequence    
MQQSFPKASG SRLPELRKKE EWAAKEIHLA DDKHKVSELP KSPNYSSLIA LYLQGNYERT  60
AVPPLFFRRM ALLQVLDLSH TSIKSLPKSL PKLVSLKKLS LRGCELFMEL SPQVGKLKNL  120
EELDLDETQI IDLPSEIGRL VKLSHLRVSF YHICGKKKSK SNFVIHPEAI SVLSQLAELS  180
IDVNPADKRW DDSVEAVVKE ACNSKTLKTL SLYLPKFQLL DNISLIYPSL PHFKFTVGHH  240
KRRIISRVPH EVEAEFRNWD KCLKFVNGES IPIEIEAVLK YSSSFFLDNH ATAMNLSEFG  300
IENMKRLKFC LLAECNKMET LIDGQMHDER NEDDQSESDT GSAEHVLESL EYLSIYYMEN  360
LWSLWRGPNR CGCMSRLKFL ALHTCPQLRH IFSRTLFENF VNLEEIIVED CPQVTSLVSR  420
ASVKPMMSNK FFPSLKRLLL LYLPRLVSIS NGLLITPKLE SIGFYNCPKL KSISKVELSS  480
KTLKIIKGER QWWEDLNWNE TEWGNRPDYL MHMFSPISNE KDVMTQLTED RDLLEATIQN  540
VGQQQEYFVR QDLLDLTESV HHNSAKMQQS VPMASGSGLP KLRKEEEWAA KEIHLTDDKH  600
NVFELPKSPY CSSLIALYLQ GNYKLTAIPP QFFRRMALLQ VLDLSHTSIK SLPKSLPKLV  660
SLKKLWLRGC ELFMELSPQV GKLKNLEELY LDETQIMDLP SEIGKLVKLS HLRVSFYHSC  720
GKKKSKSNFV IHPETISVLS QLAELSIEVN PADKRWDDLV EAVVKEVCNS KTLKTLSLHL  780
PKFQLLDNLS LIYPSLSHFS FTVGHRKNRI VSRVPYEVEA EFRNWDKCLK FVNGENIPIE  840
IEAVLKYSSS FFLDNHATAM NLSEFGIKNM KGLKFCLLAE CNKMETLIDG EINDERNEDD  900
QSKSDLGSAE HLLESLEYLS IYYMENLWSI WRGRNRYGCM SKLKFLALHT CPQLRNIFSH  960
TLLENFVNLE EIIVEDCPQV TSLVSHASVK PMMSNKFLPS LKRLLLLYLP GLISISNGLL  1020
IAPKLESIGF YNCPKLKSLS KMELSSKTLK IIKGECQWWE DLNWNETERG TRPDYLMRIF  1080
TPIRNEKDVM TQLTEDRDLL DATIQNEGQQ QDDEKLLEVS TEDHKHQCSG NCGSLLLDYK  1140
EERIPGTDVT KSPSSCILPS NPLTGTNVTK CPSACILPSN SWTGTDLTNS SSSCILPFNP  1200
LRTFDAPKQA LSFFSSEKNK RLEDCYFDQA AEICEVDVDE DEPKAKRSNC TENENKGVIG  1260
PVSKTTRGHR VAVRTRSNSV VLDDGYRWRK YGKKKKSVQG NPHPRCYYKC STMGCPAKKR  1320
FERDYQDTSF LITTYEGVHN HGCYNMRLYN LHTRLCNDHR ANYMEDAADS AKTISPTKGY  1380
EDVFEAPIQD IGVQPEYEGI PEALIRDESQ QSDPQPSELA IRMSESPPSP IGSTPWSEVY  1440
DCDFKEQELK DDSDKRLLCC RRNLSRWKEL IRVPSTGLEV PPDDGFSWRK CGQKEILGSK  1500
YPRAYYRCTH RNVQDCMATK QVQRSDDDPT IFEITYHGRH TCTLASHVVP SPGPLENQDH  1560
GTSSVRSTYW KVFSMLNCNT SLVAEPQPSE QAIRMSDSQP PRIGSTPRSE VHDCDFEEQE  1620
LKGDSKKRKT PSRWTELIRV PSTGLEVPPD DGFSWRKCGQ KEVLGTKYPR AYYRCTHRNV  1680
QNCWATKQVK RSDDDPTIFE ITYCGRHTCT LASHVVPSPG PLKNQDQGTC SVLSTYCKAF  1740
SMLNCNTSLV AEAQPSEQAI RMSESQPPRI GSTPQSEVHD CDFEEQELED DSKKRKTLSR  1800
WTELIRVPST GLEVPPDDGF SWRKYEQKEI LGSKYTRSY
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
116211629LKGDSKKRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G11070.14e-38WRKY family protein