PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG034409t1
Common NameTCM_034409
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family WOX
Protein Properties Length: 229aa    MW: 25880.8 Da    PI: 4.8551
Description WOX family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG034409t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox62.94.6e-2088148257
                       T--SS--HHHHHHHHHHHHH.SSS--HHHHHHHHHHC....TS-HHHHHHHHHHHHHHHHC CS
          Homeobox   2 rkRttftkeqleeLeelFek.nrypsaeereeLAkkl....gLterqVkvWFqNrRakekk 57 
                       r+R+t+t+ ql++Le+ +   n +ps+++++e++ +l    +++e++V++WFqNrRa+ k+
  Thecc1EG034409t1  88 RQRWTPTSLQLQILESIYDLgNGTPSKQKIKEITVELaqhgQISETNVYNWFQNRRARSKR 148
                       89*****************88**************************************97 PP

2Wus_type_Homeobox99.62.3e-3287150265
  Wus_type_Homeobox   2 artRWtPtpeQikiLeelyksGlrtPnkeeiqritaeLeeyGkiedkNVfyWFQNrkaRerqkq 65 
                        ar+RWtPt+ Q++iLe++y+ G  tP+k++i++it eL+++G+i+++NV++WFQNr+aR+++kq
   Thecc1EG034409t1  87 ARQRWTPTSLQLQILESIYDLGNGTPSKQKIKEITVELAQHGQISETNVYNWFQNRRARSKRKQ 150
                        789***********************************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5007114.07484149IPR001356Homeobox domain
SMARTSM003892.2E-1286153IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.601.1E-1587153IPR009057Homeodomain-like
SuperFamilySSF466893.94E-1787151IPR009057Homeodomain-like
PfamPF000466.2E-1888148IPR001356Homeobox domain
CDDcd000868.36E-1388150No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 229 aa     Download sequence    Send to blast
MEWQNQEMQQ QEEGYLQYQF QNGVCGKVMT DEQMEELRKQ IVAYAVISEQ LAEMHKAMSA  60
HQDFTGIRLG NFYCDPIVAS IGHNITARQR WTPTSLQLQI LESIYDLGNG TPSKQKIKEI  120
TVELAQHGQI SETNVYNWFQ NRRARSKRKQ QSSGSINAEP EADAEGLGTK EKRTKPESLK  180
FIDIPAQGVE SFYFQNPDTG IDQFTGKVES SGGYDPYNNL VEQFGLLG*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1141149RRARSKRKQ
Functional Description ? help Back to Top
Source Description
UniProtTranscription factor which may be involved in developmental processes. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007018072.11e-171PREDICTED: WUSCHEL-related homeobox 8
SwissprotQ5QMM36e-65WOX8_ORYSJ; WUSCHEL-related homeobox 8
TrEMBLA0A061FF181e-170A0A061FF18_THECC; WUSCHEL related homeobox 13, putative
STRINGEOY152971e-171(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM17223611
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G35550.12e-54WUSCHEL related homeobox 13
Publications ? help Back to Top
  1. Kikuchi S, et al.
    Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice.
    Science, 2003. 301(5631): p. 376-9
    [PMID:12869764]
  2. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]