PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG035069t1
Common NameTCM_035069
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family GRAS
Protein Properties Length: 488aa    MW: 53980.7 Da    PI: 5.2134
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG035069t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS3442.4e-1051014811374
              GRAS   1 lvelLlecAeavssgdle..laqalLarlselaspdg.dpmqRlaayfteALaarlarsvselykal....ppsetseknsseelaalklfse 86 
                       lv+lL+++Aea++  +++  la+ +L rl+el+sp++ ++m+RlaayfteAL+  l +s+ +  k +    p  + +e++++++laa++l+++
  Thecc1EG035069t1 101 LVHLLIAAAEALTGANKSreLARVILVRLKELVSPNDgTNMERLAAYFTEALQGLLEGSGGGHGKHFitngPHYHRDEHHQTDVLAAFQLLQD 193
                       68**********986665559***********8887669******************987766666522113444444559************ PP

              GRAS  87 vsPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegpp..slRiTgvgspesg..skeeleetgerLakfAeelgvp 175
                       +sP++kf+h+taNqaIleav++++r+Hi+D+di++G+QW++L+qaL sR++gpp  +lRiT+++++ sg  s  +++etg+rL  fA+++g p
  Thecc1EG035069t1 194 MSPYVKFGHFTANQAILEAVAHDRRIHIVDYDIMEGIQWASLMQALVSRKDGPPapHLRITALSRSGSGrrSIGTIQETGRRLVAFAASIGQP 286
                       **************************************************776555*********888899999******************* PP

              GRAS 176 fefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadh.nsesFlerflealeyysal 267
                       f+f+    ++ e++++++ ++ +gEal++n++l+l ++  +  ++     ++L   k+l P++v++ve+e+   +++ F+ rf+++l++ysa+
  Thecc1EG035069t1 287 FSFHQYRLDSDETFRPSAVKLVRGEALVINCMLHLPHFSYRAPDSV---ASFLTGAKTLDPRLVTLVEEEVGPiGDGGFVGRFMDSLHHYSAV 376
                       *****989999**************************985555555...59*******************9988******************* PP

              GRAS 268 fdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveee.sgslv 359
                       +dslea++p + ++r+ vEr++lg++i   ++   + r e  e+   W+e+l ++GFkpv++s  + +qaklll  ++ dgy+vee  ++ lv
  Thecc1EG035069t1 377 YDSLEAGFPMQGRARALVERVFLGPRIAGSLTRIYRTRGE--EESCPWSEWLAAVGFKPVNISFANHCQAKLLLGLFN-DGYSVEELaNNRLV 466
                       *********************************9998665..5569********************************.******97588999 PP

              GRAS 360 lgWkdrpLvsvSaWr 374
                       lgWk+r+L+s+S W+
  Thecc1EG035069t1 467 LGWKSRRLLSASIWT 481
                       **************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098553.07875461IPR005202Transcription factor GRAS
PfamPF035148.4E-103101481IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 488 aa     Download sequence    Send to blast
MAMAIDNAFE VDFSSYSTTT TTTTTDDDLA CTWNDWGSPV VDWDSLSSER DEFQDLIESM  60
MDDGTGIELA RVDHETSNSV SIDTMVADEE SNREDFKGLR LVHLLIAAAE ALTGANKSRE  120
LARVILVRLK ELVSPNDGTN MERLAAYFTE ALQGLLEGSG GGHGKHFITN GPHYHRDEHH  180
QTDVLAAFQL LQDMSPYVKF GHFTANQAIL EAVAHDRRIH IVDYDIMEGI QWASLMQALV  240
SRKDGPPAPH LRITALSRSG SGRRSIGTIQ ETGRRLVAFA ASIGQPFSFH QYRLDSDETF  300
RPSAVKLVRG EALVINCMLH LPHFSYRAPD SVASFLTGAK TLDPRLVTLV EEEVGPIGDG  360
GFVGRFMDSL HHYSAVYDSL EAGFPMQGRA RALVERVFLG PRIAGSLTRI YRTRGEEESC  420
PWSEWLAAVG FKPVNISFAN HCQAKLLLGL FNDGYSVEEL ANNRLVLGWK SRRLLSASIW  480
TSPDSDF*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A3e-589248110379Protein SCARECROW
5b3h_A2e-58924819378Protein SCARECROW
5b3h_D2e-58924819378Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtTranscriptional regulator essential for Nod-factor-induced gene expression. Acts downstream of calcium spiking and DMI3, a calcium/calmodulin-dependent protein kinase (CCaMK). {ECO:0000269|PubMed:15961668}.
Regulation -- Description ? help Back to Top
Source Description
UniProtINDUCTION: Induced 24 hours after treatment with Nod factor or Sinorhizobium meliloti. {ECO:0000269|PubMed:15961668}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007019002.20.0PREDICTED: nodulation-signaling pathway 2 protein
SwissprotQ5NE240.0NSP2_MEDTR; Nodulation-signaling pathway 2 protein
TrEMBLA0A061FHT30.0A0A061FHT3_THECC; GRAS family transcription factor
STRINGEOY162270.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM99142736
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G08250.11e-170GRAS family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]
  2. Tang H, et al.
    An improved genome release (version Mt4.0) for the model legume Medicago truncatula.
    BMC Genomics, 2014. 15: p. 312
    [PMID:24767513]