PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.008G201000.1
Common NameB456_008G201000
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family WRKY
Protein Properties Length: 853aa    MW: 95272.7 Da    PI: 7.8829
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.008G201000.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY42.51.3e-137452159
                        -EEEEEE-STT---EEEEEE-SSSTTEEEEEEES--SS- CS
                WRKY 21 prsYYrCtsagCpvkkkversaedpkvveitYegeHnhe 59
                         r YYrC s++C +kk ver+++d++++++tY+g Hnh+
  Gorai.008G201000.1  7 YRLYYRCLSTSCLAKKYVERDSQDTSFFVTTYHGLHNHD 45
                        699***********************************7 PP

2WRKY60.53.2e-19163223259
                         --SS-EEEEEEE--TT-SS-EEEEEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS- CS
                WRKY   2 dDgynWrKYGqKevkgsefprsYYrCtsa...gCpvkkkversaedpkvveitYegeHnhe 59 
                         dD ++Wr   qK+v g ++prs++rCt++   gC ++k v+r ++dp+++eitY+ +H+++
  Gorai.008G201000.1 163 DDSFCWRISEQKNVIGEKYPRSFFRCTHRhnqGCLATKTVQRLDDDPTFFEITYHRKHTCN 223
                         8***************************99999**************************96 PP

3WRKY46.29.2e-15643699260
                         --SS-EEEEEEE--TT-SS-EEEEEE-STT---EEEEEE-SSSTTEEEEEEES--SS-- CS
                WRKY   2 dDgynWrKYGqKevkgsefprsYYrCtsagCpvkkkversaedpkvveitYegeHnhek 60 
                          D y+W+  G K   g++  +s+YrC+++gC++kk vers  d+k ++++ + +Hnh+k
  Gorai.008G201000.1 643 ADAYTWKCHGTKGLIGNR-RKSFYRCAHPGCQAKKSVERSL-DGKSFIVHSRASHNHPK 699
                         69**************97.58********************.***************85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007743.2E-7146IPR003657WRKY domain
SuperFamilySSF1182901.57E-11545IPR003657WRKY domain
Gene3DG3DSA:2.20.25.801.7E-11645IPR003657WRKY domain
PfamPF031065.5E-9745IPR003657WRKY domain
PROSITE profilePS5081115.066847IPR003657WRKY domain
Gene3DG3DSA:2.20.25.801.0E-18158223IPR003657WRKY domain
SuperFamilySSF1182904.05E-18160224IPR003657WRKY domain
SMARTSM007744.1E-24162224IPR003657WRKY domain
PfamPF031063.7E-17163222IPR003657WRKY domain
PROSITE profilePS5081116.398163225IPR003657WRKY domain
PfamPF078871.2E-50341482IPR012416CALMODULIN-BINDING PROTEIN60
PROSITE profilePS5081113.207637700IPR003657WRKY domain
Gene3DG3DSA:2.20.25.803.9E-12638699IPR003657WRKY domain
SuperFamilySSF1182902.35E-11640698IPR003657WRKY domain
SMARTSM007744.8E-8642699IPR003657WRKY domain
PfamPF031062.2E-9644698IPR003657WRKY domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0006950Biological Processresponse to stress
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0005516Molecular Functioncalmodulin binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 853 aa     Download sequence    Send to blast
MSLSTLYRLY YRCLSTSCLA KKYVERDSQD TSFFVTTYHG LHNHDEWNSR GLSKLHSQPC  60
IDHRANNVKD AAEAAKSICP TKGIPEASIQ DESQRSGVEP QPSALEIIVS ESTESTPLPL  120
TGSPLSGDFD RDFKEQDQFN LDASKKSETQ QVQVRPGSAL TPDDSFCWRI SEQKNVIGEK  180
YPRSFFRCTH RHNQGCLATK TVQRLDDDPT FFEITYHRKH TCNLASNVMP PTAPSRNQEQ  240
GTRIEPQQQY NQLPEENQKQ QSQDLLVLPS TPGQCVEQSL NQKSNSGNDQ QTISQEDNNS  300
TIVCQASSPS PSDSSSMRSQ LSAAALSTAQ LQAQRFEEPS KQNQLYKKHY PPMLGDEVWR  360
LDMIGKNGII HKRLASEGIN TVQDFLKMSV VSPGELRRIL GPRMSDRMWD NATKHARTCA  420
MGNKYYVFRG SNYRILLNPI CQLMGAEVNG SIYPTHSLSN IDTVYLEKLV RQAYVNWSSL  480
EEIEGISNEI IGPLTQDIMA QRMGANVINT IPPNLPAMPP SGPWLPELPD HPVLMDNSNV  540
LSSPTTGECV VQSLNQKNNS GNDQQTISQE DDNSIIVYHA PPPSHSNSSA MIFELSATTL  600
PIAQVQAQSF EELAERNQSK GNLQFQACCY KQDSDLTKSG KAADAYTWKC HGTKGLIGNR  660
RKSFYRCAHP GCQAKKSVER SLDGKSFIVH SRASHNHPKS LPTRTSSLSA FSHIRASNHL  720
TIKIPDKSSV TYEGGQMDMD GLVFFVRSVE DETGLQNLMD KKSDQPSDRH KAFVGLGVRK  780
AKTSFRKRAK TTVFKLSSDT NFREMMQSFT GKHTNEVQEE KVTRGIPRKK AWDANLKEIS  840
GNDIHCVREN VG*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
6ir8_A3e-13163222968OsWRKY45
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC2431141e-131AC243114.1 Gossypium raimondii clone GR__Ba0041F12-jfm, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016736441.10.0PREDICTED: uncharacterized protein LOC107946565 isoform X2
TrEMBLA0A0D2RIT20.0A0A0D2RIT2_GOSRA; Uncharacterized protein
STRINGGorai.008G201000.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM1975946
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G23810.14e-23WRKY family protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]