![]() |
PlantRegMap/PlantTFDB v5.0
Plant Transcription
Factor Database
|
| Home TFext BLAST Prediction Download Help About Links PlantRegMap |
Transcription Factor Information
| Basic Information? help Back to Top | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| TF ID | Thecc1EG035565t1 | ||||||||
| Common Name | TCM_035565 | ||||||||
| Organism | |||||||||
| Taxonomic ID | |||||||||
| Taxonomic Lineage |
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
|
||||||||
| Family | Trihelix | ||||||||
| Protein Properties | Length: 638aa MW: 70075 Da PI: 6.2329 | ||||||||
| Description | Trihelix family protein | ||||||||
| Gene Model |
|
||||||||
| Signature Domain? help Back to Top | |||||||
|---|---|---|---|---|---|---|---|
| No. | Domain | Score | E-value | Start | End | HMM Start | HMM End |
| 1 | trihelix | 94.7 | 9e-30 | 86 | 170 | 1 | 87 |
trihelix 1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87
rW++qe+laL+++r++m+ ++r+++ k+plWeevs+k++e g++rs+k+Ckek+en+ k++k++k+g+ ++++++ +++fdqlea
Thecc1EG035565t1 86 RWPRQETLALLKIRSDMDVTFRDASVKGPLWEEVSRKLAELGYHRSAKKCKEKFENVYKYHKRTKDGRTGKSDGK--AYRFFDQLEA 170
8********************************************************************975555..5*******85 PP
| |||||||
| 2 | trihelix | 104.1 | 1e-32 | 444 | 529 | 1 | 87 |
trihelix 1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87
rW+k ev aLi++r++++ +++++ k+plWee+s++m++ g++r++k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+ql+a
Thecc1EG035565t1 444 RWPKVEVEALIKLRTSLDAKYQENGPKGPLWEEISAAMKKLGYNRNAKRCKEKWENINKYFKKVKESNKKR-PEDSKTCPYFHQLDA 529
8*********************************************************************8.99***********85 PP
| |||||||
| Protein Features ? help Back to Top | ||||||
|---|---|---|---|---|---|---|
| Database | Entry ID | E-value | Start | End | InterPro ID | Description |
| SMART | SM00717 | 4.7E-4 | 83 | 145 | IPR001005 | SANT/Myb domain |
| CDD | cd12203 | 5.70E-24 | 85 | 150 | No hit | No description |
| Pfam | PF13837 | 2.3E-19 | 85 | 171 | No hit | No description |
| PROSITE profile | PS50090 | 6.911 | 85 | 143 | IPR017877 | Myb-like domain |
| SMART | SM00717 | 4.4E-4 | 441 | 503 | IPR001005 | SANT/Myb domain |
| CDD | cd12203 | 2.82E-27 | 443 | 508 | No hit | No description |
| Pfam | PF13837 | 7.8E-23 | 443 | 530 | No hit | No description |
| Gene3D | G3DSA:1.10.10.60 | 2.5E-4 | 443 | 500 | IPR009057 | Homeodomain-like |
| PROSITE profile | PS50090 | 7.189 | 443 | 501 | IPR017877 | Myb-like domain |
| Gene Ontology ? help Back to Top | ||||||
|---|---|---|---|---|---|---|
| GO Term | GO Category | GO Description | ||||
| GO:0010192 | Biological Process | mucilage biosynthetic process | ||||
| GO:0044212 | Molecular Function | transcription regulatory region DNA binding | ||||
| Sequence ? help Back to Top |
|---|
| Protein Sequence Length: 638 aa Download sequence Send to blast |
MLGCGDTSVS VLGSSSGGGG GDVAAAAAVA TTVSSSGALD GRSEAAANML GSNGNNNNNN 60 NNTNNNSGDD DRGRVDEGDR SFGGNRWPRQ ETLALLKIRS DMDVTFRDAS VKGPLWEEVS 120 RKLAELGYHR SAKKCKEKFE NVYKYHKRTK DGRTGKSDGK AYRFFDQLEA LENISSIQSP 180 AAPPPPSPQL KPQHQTVMPA ANPPSLSHIT IPSTTLASLP QNIVPPNASF TVPSFPSTNP 240 TIQPPPPTTN PTIPSFPNIS ADLMSNSTSS STSSDLELEG RRKRKRKWKD FFERLMKEVI 300 QKQEDMQKKF LEAIEKREHE RLVREDAWRM QEMARINRER EILAQERSIA AAKDAAVMAF 360 LQKLSEQRNP GQAQNNPLPS QQPQPPPQAP PQPVPAVATA APPAATAAPV PAPAPPLLPL 420 PMVNLDVSKT DNGDQSYTPS SSSRWPKVEV EALIKLRTSL DAKYQENGPK GPLWEEISAA 480 MKKLGYNRNA KRCKEKWENI NKYFKKVKES NKKRPEDSKT CPYFHQLDAL YREKNKLDNS 540 SNELKPENSV PLLVRPEQQW PPPPSEPDDH QHDHATEDME SEQNQDEDEK DGDDEEEDEG 600 GDYEIVASKP VSMGTAAICP ASGSGSGNGA LEWRHLN* |
| Nucleic Localization Signal ? help Back to Top | |||
|---|---|---|---|
| No. | Start | End | Sequence |
| 1 | 280 | 285 | RRKRKR |
| 2 | 280 | 286 | RRKRKRK |
| 3 | 281 | 286 | RKRKRK |
| Functional Description ? help Back to Top | ||||||
|---|---|---|---|---|---|---|
| Source | Description | |||||
| UniProt | Probable transcription factor that binds specific DNA sequence. {ECO:0000250}. | |||||
| Binding Motif ? help Back to Top | |||
|---|---|---|---|
| Motif ID | Method | Source | Motif file |
| MP00243 | DAP | Transfer from AT1G76880 | Download |
| |||
| Regulation -- PlantRegMap ? help Back to Top | ||||||
|---|---|---|---|---|---|---|
| Source | Upstream Regulator | Target Gene | ||||
| PlantRegMap | Retrieve | Retrieve | ||||
| Annotation -- Nucleotide ? help Back to Top | ||||||
|---|---|---|---|---|---|---|
| Source | Hit ID | E-value | Description | |||
| GenBank | JX963782 | 1e-104 | JX963782.1 Gossypium hirsutum clone NBRI_TRANS-1010 microsatellite sequence. | |||
| Annotation -- Protein ? help Back to Top | |||||||
|---|---|---|---|---|---|---|---|
| Source | Hit ID | E-value | Description | ||||
| Refseq | XP_017981065.1 | 0.0 | PREDICTED: trihelix transcription factor GT-2 | ||||
| Swissprot | Q39117 | 1e-148 | TGT2_ARATH; Trihelix transcription factor GT-2 | ||||
| TrEMBL | A0A061FJ90 | 0.0 | A0A061FJ90_THECC; Duplicated homeodomain-like superfamily protein isoform 1 | ||||
| STRING | EOY16707 | 0.0 | (Theobroma cacao) | ||||
| Orthologous Group ? help Back to Top | |||
|---|---|---|---|
| Lineage | Orthologous Group ID | Taxa Number | Gene Number |
| Malvids | OGEM4849 | 25 | 53 |
| Best hit in Arabidopsis thaliana ? help Back to Top | ||||||
|---|---|---|---|---|---|---|
| Hit ID | E-value | Description | ||||
| AT1G76880.1 | 1e-92 | Trihelix family protein | ||||
| Link Out ? help Back to Top | |
|---|---|
| Phytozome | Thecc1EG035565t1 |
| Entrez Gene | 18592605 |
| Publications ? help Back to Top | |||
|---|---|---|---|
|
|||




