PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022769623.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Helicteroideae; Durio
Family WRKY
Protein Properties Length: 1834aa    MW: 209693 Da    PI: 6.9073
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022769623.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY79.34.3e-2512821341158
                      ---SS-EEEEEEE..--TT-SS-EEEEEE-STT---EEEEEE-SSSTTEEEEEEES--SS CS
            WRKY    1 ldDgynWrKYGqK..evkgsefprsYYrCtsagCpvkkkversaedpkvveitYegeHnh 58  
                      ldDgy+WrKYG+K  +v+g+++pr YY+C++ gCp+kk+ er+ +d++++++tYeg Hnh
  XP_022769623.1 1282 LDDGYRWRKYGKKkkSVQGNPHPRCYYKCSTMGCPAKKRFERDYQDTSFLITTYEGVHNH 1341
                      59*********85227******************************************** PP

2WRKY86.32.7e-2714781538259
                      --SS-EEEEEEE--TT-SS-EEEEEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS- CS
            WRKY    2 dDgynWrKYGqKevkgsefprsYYrCtsa...gCpvkkkversaedpkvveitYegeHnhe 59  
                      dDg++WrK GqKe+ gs++pr+YYrCt++   +C+++k+v+rs++dp+++eitY+g+H+++
  XP_022769623.1 1478 DDGFSWRKCGQKEILGSKYPRAYYRCTHRnvqDCMATKQVQRSDDDPTIFEITYHGRHTCT 1538
                      8***************************98999**************************96 PP

3WRKY81.21.1e-2516451705259
                      --SS-EEEEEEE--TT-SS-EEEEEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS- CS
            WRKY    2 dDgynWrKYGqKevkgsefprsYYrCtsa...gCpvkkkversaedpkvveitYegeHnhe 59  
                      dDg++WrK GqKev g+++pr+YYrCt++   +C+++k+v+rs++dp+++eitY g+H+++
  XP_022769623.1 1645 DDGFSWRKCGQKEVLGTKYPRAYYRCTHRnvqNCWATKQVKRSDDDPTIFEITYCGRHTCT 1705
                      8***************************99999**************************96 PP

4WRKY25.72.3e-0818121834224
                      --SS-EEEEEEE--TT-SS-EEE CS
            WRKY    2 dDgynWrKYGqKevkgsefprsY 24  
                      dDg++WrKY qKe+ gs+++rsY
  XP_022769623.1 1812 DDGFSWRKYEQKEILGSKYTRSY 1834
                      8*********************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1834 aa     Download sequence    
MQQSFPKASG SRLPELRKKE EWAAKEIHLA DDKHKVSELP KSPNYSSLIA LYLQGNYERT  60
AVPPLFFRRM ALLQVLDLSH TSIKSLPKSL PKLVSLKKLS LRGCELFMEL SPQVGKLKNL  120
EELDLDETQI IDLPSEIGRL VKLSHLRVSF YHICGKKKSK SNFVIHPEAI SVLSQLAELS  180
IDVNPADKRW DDSVEAVVKE ACNSKTLKTL SLYLPKFQLL DNISLIYPSL PHFKFTVGHH  240
KRRIISRVPH EVEAEFRNWD KCLKFVNGES IPIEIEAVLK YSSSFFLDNH ATAMNLSEFG  300
IENMKRLKFC LLAECNKMET LIDGQMHDER NEDDQSESDT GSAEHVLESL EYLSIYYMEN  360
LWSLWRGPNR CGCMSRLKFL ALHTCPQLRH IFSRTLFENF VNLEEIIVED CPQVTSLVSR  420
ASVKPMMSNK FFPSLKRLLL LYLPRLVSIS NGLLITPKLE SIGFYNCPKL KSISKVELSS  480
KTLKIIKGER QWWEDLNWNE TEWGNRPDYL MHMFSPISNE KDVMTQLTED RDLLEATIQN  540
VGQQQEYFVR QDLLDLTESV HHNSAKMQQS VPMASGSGLP KLRKEEEWAA KEIHLTDDKH  600
NVFELPKSPY CSSLIALYLQ GNYKLTAIPP QFFRRMALLQ VLDLSHTSIK SLPKSLPKLV  660
SLKKLWLRGC ELFMELSPQV GKLKNLEELY LDETQIMDLP SEIGKLVKLS HLRVSFYHSC  720
GKKKSKSNFV IHPETISVLS QLAELSIEVN PADKRWDDLV EAVVKEVCNS KTLKTLSLHL  780
PKFQLLDNLS LIYPSLSHFS FTVGHRKNRI VSRVPYEVEA EFRNWDKCLK FVNGENIPIE  840
IEAVLKYSSS FFLDNHATAM NLSEFGIKNM KGLKFCLLAE CNKMETLIDG EINDERNEDD  900
QSKSDLGSAE HLLESLEYLS IYYMENLWSI WRGRNRYGCM SKLKFLALHT CPQLRNIFSH  960
TLLENFVNLE EIIVEDCPQV TSLVSHASVK PMMSNKFLPS LKRLLLLYLP GLISISNGLL  1020
IAPKLESIGF YNCPKLKSLS KMELSSKTLK IIKGECQWWE DLNWNETERG TRPDYLMRIF  1080
TPIRNEKDVM TQLTEDRDLL DATIQNEGQQ QDDEKLLEVS TEDHKHQCSG NCGSLLLDYK  1140
EERIPGTDVT KSPSSCILPS NPLTGTNVTK CPSACILPSN SWTGTDLTNS SSSCILPFNP  1200
LRTFDAPKQA LSFFSSEKNK RLEDCYFDQA AEICEVDVDE DEPKAKRSNC TENENKGVIG  1260
PVSKTTRGHR VAVRTRSNSV VLDDGYRWRK YGKKKKSVQG NPHPRCYYKC STMGCPAKKR  1320
FERDYQDTSF LITTYEGVHN HGCYNMRLYN LHTRLCNDHR ANYMEDAADS AKTISPTKGY  1380
EDVFEAPIQD IGVQPEYEGI PEALIRDESQ QSADPQPSEL AIRMSESPPS PIGSTPWSEV  1440
YDCDFKEQEL KDDSDKRNLS RWKELIRVPS TGLEVPPDDG FSWRKCGQKE ILGSKYPRAY  1500
YRCTHRNVQD CMATKQVQRS DDDPTIFEIT YHGRHTCTLA SHVVPSPGPL ENQDHGTSSV  1560
RSTYWKVFSM LNCNTSLVAE PQPSEQAIRM SDSQPPRIGS TPRSEVHDCD FEEQELKGDS  1620
KKRKTPSRWT ELIRVPSTGL EVPPDDGFSW RKCGQKEVLG TKYPRAYYRC THRNVQNCWA  1680
TKQVKRSDDD PTIFEITYCG RHTCTLASHV VPSPGPLKNQ DQGTCSVLST YCKAFSMLNC  1740
NTSLVAEAQP SEQAIRMSES QPPRIGSTPQ SEVHDCDFEE QELEDDSKKR KTLSRWTELI  1800
RVPSTGLEVP PDDGFSWRKY EQKEILGSKY TRSY
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
116161624LKGDSKKRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G11070.13e-38WRKY family protein