PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG000518t1
Common NameTCM_000518
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 521aa    MW: 59935.7 Da    PI: 9.3286
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG000518t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B359.74.9e-1928117298
                       EEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EE CS
                B3   2 fkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvv 94 
                       fk+   +d+ +++ l++p+kf++++g +   s  + l+ +sg +W v+l   k++g ++l++GWkeF++   Lk g f+vFk++g+ +f   v
  Thecc1EG000518t1  28 FKII-LEDTIRHSKLRFPRKFVTKYGDS--LSSPVLLMVPSGSTWHVEL--IKSDGDVWLQNGWKEFAEHYSLKHGHFLVFKYQGDCNF--QV 113
                       4444.489999999***********877..5557***************..**********************************9999..88 PP

                       EEE- CS
                B3  95 kvfr 98 
                        +f+
  Thecc1EG000518t1 114 LIFD 117
                       8886 PP

2B355.11.4e-17235331196
                       EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE.....EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliy...rkksgryvltkGWkeFvkangLkegDfvvFkldgrsef 90 
                       f+ ++ ps+ +  + l+lp +f++++ ++k +  ++tl +++g++W  k++    ++k  +  l  GW  F ++n+L+ gD++vF+l+++++ 
  Thecc1EG000518t1 235 FVVEMQPSYINPGRKLCLPSSFITKCLKEKVG--DVTLCTLDGKTWPAKYCCyltNNKYTKAALHCGWTAFMQDNKLELGDVCVFELIEQTKI 325
                       5566777777888999**********766666..79999***********6669888888899***********************9986555 PP

                       ..EEEE CS
                B3  91 elvvkv 96 
                        l+v +
  Thecc1EG000518t1 326 LLKVII 331
                       455555 PP

3B364.12.2e-20423520197
                       EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE....EEETTE.EEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliy..rkksgr.yvltkGWkeFvkangLkegDfvvFkldgrsef 90 
                       f+ v+ p++ ++++ + +p kfa++h  +ke    ++l+ ++gr+W vk+++  +++++r   + + W+ F+++n+L++gD++vF+l++r+e+
  Thecc1EG000518t1 423 FKVVMQPAYLGARCSVNIPYKFAKRHVDEKED--RVILQVSNGRRWIVKFSVkvTNSGQRkARFYDTWRAFAQDNNLEVGDVCVFELINRDET 513
                       556778899999**************666555..6***************8887555555699999**********************99888 PP

                       ..EEEEE CS
                B3  91 elvvkvf 97 
                       +++v++f
  Thecc1EG000518t1 514 SFKVSIF 520
                       8899886 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.102.2E-2719119IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.16E-2520119IPR015300DNA-binding pseudobarrel domain
CDDcd100178.76E-2125117No hitNo description
PROSITE profilePS5086314.29726119IPR003340B3 DNA binding domain
SMARTSM010194.7E-1827119IPR003340B3 DNA binding domain
PfamPF023621.4E-1629117IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.106.4E-21226333IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019365.69E-21230335IPR015300DNA-binding pseudobarrel domain
CDDcd100176.09E-19233333No hitNo description
SMARTSM010196.5E-12235335IPR003340B3 DNA binding domain
PfamPF023626.3E-16235333IPR003340B3 DNA binding domain
PROSITE profilePS5086312.689235335IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.108.7E-24414520IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019366.67E-21416520IPR015300DNA-binding pseudobarrel domain
CDDcd100171.07E-21421520No hitNo description
SMARTSM010191.5E-11423518IPR003340B3 DNA binding domain
PfamPF023621.2E-17423520IPR003340B3 DNA binding domain
PROSITE profilePS5086314.861423520IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 521 aa     Download sequence    Send to blast
MASPSNKASC SQREINPSMF TPKTPHFFKI ILEDTIRHSK LRFPRKFVTK YGDSLSSPVL  60
LMVPSGSTWH VELIKSDGDV WLQNGWKEFA EHYSLKHGHF LVFKYQGDCN FQVLIFDMSA  120
SEIEYPHISP NMDRDEVCQE PNKEEDAKDD SVEVLYETPR VRKTRQNSQT PCLRPRKILR  180
RTTLSDKYKR DCEDVSSGEG YLKTKVPRGR HAFGDNENAT ALQRASAFKS ENFFFVVEMQ  240
PSYINPGRKL CLPSSFITKC LKEKVGDVTL CTLDGKTWPA KYCCYLTNNK YTKAALHCGW  300
TAFMQDNKLE LGDVCVFELI EQTKILLKVI IYRVSQDSSL VELYMFFPNV YWISIKLMYP  360
RLYNIMRTNG SGGLNSLANC DNGRLSTQGS TKSRHFLMPP LSSHEKARAM LRASSFRSEN  420
PFFKVVMQPA YLGARCSVNI PYKFAKRHVD EKEDRVILQV SNGRRWIVKF SVKVTNSGQR  480
KARFYDTWRA FAQDNNLEVG DVCVFELINR DETSFKVSIF *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4i1k_A7e-1521752027144B3 domain-containing transcription factor VRN1
4i1k_B7e-1521752027144B3 domain-containing transcription factor VRN1
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017976630.10.0PREDICTED: B3 domain-containing transcription factor VRN1
TrEMBLA0A061DGP30.0A0A061DGP3_THECC; AP2/B3-like transcriptional factor family protein, putative
STRINGEOX912720.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM16925512
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18990.13e-41B3 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]