![]() |
PlantRegMap/PlantTFDB v5.0
Plant Transcription
Factor Database
|
| Home TFext BLAST Prediction Download Help About Links PlantRegMap |
Transcription Factor Information
| Basic Information? help Back to Top | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| TF ID | Thecc1EG043101t1 | ||||||||
| Common Name | TCM_043101 | ||||||||
| Organism | |||||||||
| Taxonomic ID | |||||||||
| Taxonomic Lineage |
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
|
||||||||
| Family | MYB | ||||||||
| Protein Properties | Length: 1207aa MW: 131964 Da PI: 5.1926 | ||||||||
| Description | MYB family protein | ||||||||
| Gene Model |
|
||||||||
| Signature Domain? help Back to Top | |||||||
|---|---|---|---|---|---|---|---|
| No. | Domain | Score | E-value | Start | End | HMM Start | HMM End |
| 1 | Myb_DNA-binding | 27.7 | 6.3e-09 | 816 | 857 | 3 | 46 |
SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
Myb_DNA-binding 3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46
+WT eE e++ d + +G++ +++Ia+ + ++t +c+++++k
Thecc1EG043101t1 816 PWTSEEKEIFMDKLAAFGKD-FRKIASFLD-HKTTADCVEFYYK 857
8*****************99.*********.***********98 PP
| |||||||
| 2 | Myb_DNA-binding | 34.1 | 6.5e-11 | 1036 | 1075 | 4 | 45 |
S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
Myb_DNA-binding 4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45
WT eE +++av ++G++ ++ I+r++g +R++ qck ++
Thecc1EG043101t1 1036 WTDEEKSVFIQAVSLYGKD-FAMISRCVG-TRSRDQCKVFFS 1075
*****************99.*********.********8776 PP
| |||||||
| Protein Features ? help Back to Top | ||||||
|---|---|---|---|---|---|---|
| Database | Entry ID | E-value | Start | End | InterPro ID | Description |
| SuperFamily | SSF46689 | 4.11E-14 | 800 | 861 | IPR009057 | Homeodomain-like |
| PROSITE profile | PS51293 | 15.02 | 812 | 863 | IPR017884 | SANT domain |
| SMART | SM00717 | 6.5E-8 | 813 | 861 | IPR001005 | SANT/Myb domain |
| Gene3D | G3DSA:1.10.10.60 | 1.1E-5 | 813 | 858 | IPR009057 | Homeodomain-like |
| Pfam | PF00249 | 4.7E-6 | 815 | 857 | IPR001005 | SANT/Myb domain |
| PROSITE profile | PS51293 | 11.876 | 1031 | 1082 | IPR017884 | SANT domain |
| SMART | SM00717 | 1.0E-8 | 1032 | 1080 | IPR001005 | SANT/Myb domain |
| SuperFamily | SSF46689 | 5.93E-11 | 1035 | 1082 | IPR009057 | Homeodomain-like |
| Gene3D | G3DSA:1.10.10.60 | 1.6E-6 | 1035 | 1076 | IPR009057 | Homeodomain-like |
| Pfam | PF00249 | 2.4E-8 | 1036 | 1075 | IPR001005 | SANT/Myb domain |
| CDD | cd00167 | 1.09E-7 | 1036 | 1074 | No hit | No description |
| Gene Ontology ? help Back to Top | ||||||
|---|---|---|---|---|---|---|
| GO Term | GO Category | GO Description | ||||
| GO:0005634 | Cellular Component | nucleus | ||||
| GO:0003677 | Molecular Function | DNA binding | ||||
| Sequence ? help Back to Top |
|---|
| Protein Sequence Length: 1207 aa Download sequence Send to blast |
MPPEPLPWDR KDFYKERKHE RTESQPQQPS TARWRDSSSM SSYQHGSFRE FTRWGSADLR 60 RPPGHGKQGS WHLFAEENGG HGYVPSRSGD KMLDDESCRQ SVSRGDGKYS RNSSRENNRA 120 SYSQRDWRAH SWEMSNGSPN TPGRPHDVNN EQRSVDDMLT YPSHAHSDFV STWDQLHKDQ 180 HDNKTSGVNG LGTGQRCERE NSVGSMDWKP LKWSRSGSLS SRGSGFSHSS SSKSLGGVDS 240 GEGKLELQQK NLTPVQSPSG DAAACVTSAA PSDETMSRKK PRLGWGEGLA KYEKKKVEGP 300 DTSMNRGVAT ISVGNTEPNN SLGSNLAEKS PRVLGFSDCA SPATPSSVAC SSSPGVEEKS 360 FGKAANIDND ISNLCGSPSL GSQNHLEGPS FNLEKLDMNS IINMGSSLVD LLQSDDPSTV 420 DSSFVRSTAM NKLLLWKGDV LKALETTESE IDSLENELKT LKANSGSRYP CPATSSSLPM 480 EENGRACEEL EAISNMIPRP APLKIDPCGD ALEEKVPLCN GDLEEVNADA KDGDIDSPGT 540 ATSKFVEPSS LEKAVSPSDV KLHECSGDLG TVQLTTMGEV NLAPGSSNEG TSVPFSGEGS 600 ALEKIDNDVH GPEPSNSVAD IENIMYDVII ATNKELANSA SKVFNNLLPK DWCSVISEIA 660 NGACWQTDSL IREKIVKRKQ CIRFKERVLM LKFKAFQHAW KEDMRSPLIR KYRAKSQKKY 720 ELSLRSTLGG YQKHRSSIRS RLTSPAGNLS LESNVEMINF VSKLLSDSHV RLYRNALKMP 780 ALFLDEKEKQ VSRFISSNGL VEDPCAVEKE RALINPWTSE EKEIFMDKLA AFGKDFRKIA 840 SFLDHKTTAD CVEFYYKNHK SECFEKTKKK LDLSKQGKST ANTYLLTSGK KWSRELNAAS 900 LDVLGEASVI AAHAESGMRN RQTSAGRIFL GGRFDSKTSR VDDSIVERSS SFDVIGNDRE 960 TVAADVLAGI CGSLSSEAMS SCITSSADPG ESYQREWKCQ KVDSVVKRPS TSDVTQNIDD 1020 DTCSDESCGE MDPADWTDEE KSVFIQAVSL YGKDFAMISR CVGTRSRDQC KVFFSKARKC 1080 LGLDLIHPRT RNLGTPMSDD ANGGGSDIED ACVLESSVVC SDKLGSKVEE DLPSTIVSMN 1140 VDESDPTGEV SLQTDLNVSE ENNGRLVDHR DSEAVETMVS DVGQPEPICE SGGDMNVGQL 1200 ILALLV* |
| 3D Structure ? help Back to Top | ||||||
|---|---|---|---|---|---|---|
| PDB ID | Evalue | Query Start | Query End | Hit Start | Hit End | Description |
| 4a69_C | 3e-16 | 774 | 865 | 4 | 94 | NUCLEAR RECEPTOR COREPRESSOR 2 |
| 4a69_D | 3e-16 | 774 | 865 | 4 | 94 | NUCLEAR RECEPTOR COREPRESSOR 2 |
| Search in ModeBase | ||||||
| Regulation -- PlantRegMap ? help Back to Top | ||||||
|---|---|---|---|---|---|---|
| Source | Upstream Regulator | Target Gene | ||||
| PlantRegMap | Retrieve | - | ||||
| Annotation -- Nucleotide ? help Back to Top | ||||||
|---|---|---|---|---|---|---|
| Source | Hit ID | E-value | Description | |||
| GenBank | JX578805 | 1e-133 | JX578805.1 Gossypium hirsutum clone NBRI_GE10901 microsatellite sequence. | |||
| Annotation -- Protein ? help Back to Top | |||||||
|---|---|---|---|---|---|---|---|
| Source | Hit ID | E-value | Description | ||||
| Refseq | XP_017984688.1 | 0.0 | PREDICTED: uncharacterized protein LOC18586364 isoform X1 | ||||
| TrEMBL | A0A061FMP2 | 0.0 | A0A061FMP2_THECC; Duplicated homeodomain-like superfamily protein isoform 1 | ||||
| STRING | EOY18596 | 0.0 | (Theobroma cacao) | ||||
| Best hit in Arabidopsis thaliana ? help Back to Top | ||||||
|---|---|---|---|---|---|---|
| Hit ID | E-value | Description | ||||
| AT3G52250.1 | 0.0 | MYB family protein | ||||
| Link Out ? help Back to Top | |
|---|---|
| Phytozome | Thecc1EG043101t1 |
| Entrez Gene | 18586364 |
| Publications ? help Back to Top | |||
|---|---|---|---|
|
|||




