PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.008G200800.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family WRKY
Protein Properties Length: 1341aa    MW: 151574 Da    PI: 6.9894
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.008G200800.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY86.13.2e-27668726159
                         ---SS-EEEEEEE--TT-SS-EEEEEE-STT---EEEEEE-SSSTTEEEEEEES--SS- CS
                WRKY   1 ldDgynWrKYGqKevkgsefprsYYrCtsagCpvkkkversaedpkvveitYegeHnhe 59 
                         ldDgy+WrK+GqK+vkg+++pr YYrC s+ C +kk ver+++d++++++tY+g Hnh+
  Gorai.008G200800.1 668 LDDGYRWRKWGQKMVKGNPYPRLYYRCLSTCCLAKKYVERDSQDTSFFVTTYHGLHNHD 726
                         59********************************************************7 PP

2WRKY84.88e-27785845259
                         --SS-EEEEEEE--TT-SS-EEEEEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS- CS
                WRKY   2 dDgynWrKYGqKevkgsefprsYYrCtsa...gCpvkkkversaedpkvveitYegeHnhe 59 
                         dDg++WrKYGqK++ g+++pr YYrCt++   gC ++k v+rs++dp+++eitY+g+H+++
  Gorai.008G200800.1 785 DDGFCWRKYGQKDILGAKYPRRYYRCTYKhnqGCLATKTVKRSSDDPTIFEITYRGKHTCN 845
                         8**************************998899**************************96 PP

3WRKY89.42.9e-2812731331360
                          -SS-EEEEEEE--TT-SS-EEEEEE-ST.T---EEEEEE-SSSTTEEEEEEES--SS-- CS
                WRKY    3 DgynWrKYGqKevkgsefprsYYrCtsa.gCpvkkkversaedpkvveitYegeHnhek 60  
                          D y+WrKYGqK++kgs+++r+YY+C+s  gCp++k+ er+ e+p+++++tYegeHnh+k
  Gorai.008G200800.1 1273 DDYSWRKYGQKPIKGSPYSRGYYKCSSMrGCPARKHSERCLEEPSMLIVTYEGEHNHPK 1331
                          99***********************9988****************************85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF520581.09E-2734216IPR032675Leucine-rich repeat domain, L domain-like
Gene3DG3DSA:3.80.10.107.1E-1935175IPR032675Leucine-rich repeat domain, L domain-like
PfamPF138551.3E-946105IPR001611Leucine-rich repeat
SMARTSM00369306992IPR003591Leucine-rich repeat, typical subtype
SMARTSM003694.9116139IPR003591Leucine-rich repeat, typical subtype
Gene3DG3DSA:3.80.10.104.3E-7176209IPR032675Leucine-rich repeat domain, L domain-like
Gene3DG3DSA:3.80.10.104.3E-7294478IPR032675Leucine-rich repeat domain, L domain-like
SuperFamilySSF520581.09E-27337481IPR032675Leucine-rich repeat domain, L domain-like
Gene3DG3DSA:2.20.25.807.5E-26661726IPR003657WRKY domain
PROSITE profilePS5081128.151663728IPR003657WRKY domain
SuperFamilySSF1182907.06E-24664726IPR003657WRKY domain
SMARTSM007741.9E-30668727IPR003657WRKY domain
PfamPF031061.1E-20669726IPR003657WRKY domain
Gene3DG3DSA:2.20.25.801.4E-25783845IPR003657WRKY domain
SuperFamilySSF1182901.83E-23783846IPR003657WRKY domain
SMARTSM007742.5E-37784846IPR003657WRKY domain
PfamPF031066.6E-24785844IPR003657WRKY domain
PROSITE profilePS5081122.114785847IPR003657WRKY domain
PfamPF078873.2E-489631104IPR012416CALMODULIN-BINDING PROTEIN60
PfamPF105331.2E-512481269IPR018872Zn-cluster domain
Gene3DG3DSA:2.20.25.808.0E-3012651331IPR003657WRKY domain
PROSITE profilePS5081128.44912661332IPR003657WRKY domain
SuperFamilySSF1182902.62E-2412691331IPR003657WRKY domain
SMARTSM007742.4E-3412711331IPR003657WRKY domain
PfamPF031067.4E-2412731330IPR003657WRKY domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0006950Biological Processresponse to stress
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0005516Molecular Functioncalmodulin binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1341 aa     Download sequence    Send to blast
MLSSVTKASG PGLAEVQKVE WGAKEIHLAD DKHIVSELPR CPNCSSLIAL YLQGNYELTA  60
IPPLFFQRMQ LLQILDLSRT SIKSLPKSLR KLFALKKLLL QGCDLFMELS PQVGKLKNLE  120
ELHLDETQIM GLPKETGKLL KLQLLKVSFY HLCGKKTLKS DILIHPETIS NLSQLAELSI  180
DVNPADKRWD DSVEAVLKEV CNSKTLRTLS LYLPTFQLLD YVSLIYPSLS RFRFTVGHHK  240
RRIISRVPHE VEAEFRKWDK CLRFVNGETI PTQIKGVLKY STSFFLDHHT TAMNLSEFGI  300
ENMKGLKFCL LAECNKMETL IDGEMHYERN EDDQSESDPG SVQQMLESLE YLSIYYMEDL  360
QCIWRGADRF VCMSKLKFLA LHACPQLSEI FSLTLLENFI NLEEIILEDC PRVTSLVSHA  420
SVKPIMSDKI FLPSLKRLLV LYLPELVSIS NGLLIAPKLE SIGCYNCPKL KSISKMELSS  480
KTLKIIKGEC EWWEGMNWNE TEWGDGPGYL MHIFSPIDNQ KDVMTQMVEG RDPHEATIQN  540
EDQQLGDQKP LEVSTQDHRG QCLDYTEERM MGTDVKEPPS GCVFPSNPLC MTSHAPEQAR  600
SFTSGNNRSL EDDECFLVPN IVEVDVDEDE PKAKRWNHTE NENKGVIGSA SKTTRGNRAA  660
NQIRSKVLDD GYRWRKWGQK MVKGNPYPRL YYRCLSTCCL AKKYVERDSQ DTSFFVTTYH  720
GLHNHDEWNS RGLSKLHSQP CIDHRANNVK DAAEAAKSIC PTKEYGDVFQ ASIQDEGAHP  780
DLAPDDGFCW RKYGQKDILG AKYPRRYYRC TYKHNQGCLA TKTVKRSSDD PTIFEITYRG  840
KHTCNLASNV MPPTAPSEYQ EQGTRIEPQQ QHNQLTEENQ KQQSQDLLVL PSTPGQCVEQ  900
SFNQKSNSGN DHLTITHEDN NSTIVCQESS PSPSDSSSMG SQLSAAALST EQLQAQRFDE  960
PAKQNQLYKK HYPPMLGDEV WRLDRIDKNG IIHKRLASEG INTVQDFLKM WVVNPGELRR  1020
ILGPIMSERK LDHAINHART CVMGNKYYVF RGSNYRILLN PICQLMGAEV NGSIYPTHSL  1080
SNIDTVYLEK LVRQAYVNWS SLEEIEGISN EIIGPLTQDI MAQRTGANVI NTIPSNLRAM  1140
PPSGPWLPEL PDHPVLMDNS NVLSSPTTGE CAVQYLNQKN NSGNDQQTIS QEDNNSTIVC  1200
QVPPPSQSNS SAMTSELSAT ALPTGSDLIQ SRAPADAHSW HCYGTKGKKR KHRVKRSIKV  1260
PSIGNKLVDI PPDDYSWRKY GQKPIKGSPY SRGYYKCSSM RGCPARKHSE RCLEEPSMLI  1320
VTYEGEHNHP KLPSRATTSS *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1wj2_A9e-2166713341478Probable WRKY transcription factor 4
2lex_A9e-2166713341478Probable WRKY transcription factor 4
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC2431140.0AC243114.1 Gossypium raimondii clone GR__Ba0041F12-jfm, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016736437.10.0PREDICTED: uncharacterized protein LOC107946563 isoform X1
TrEMBLA0A1U8NBP10.0A0A1U8NBP1_GOSHI; uncharacterized protein LOC107946563 isoform X1
STRINGGorai.008G200800.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM1975946
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G30590.12e-50WRKY DNA-binding protein 21