PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CA04g13540
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family Trihelix
Protein Properties Length: 666aa    MW: 73888.4 Da    PI: 6.8869
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CA04g13540genomePEPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix95.45.3e-3060144187
    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                 rW++qe+ aL+++r+em++ +r+++lk+plWeevs+km++ g++rs+k+Ckek+en+ k++k++k+g+ ++++++  ++++f+qlea
  CA04g13540  60 RWPRQETIALLKIRSEMDHVFRDSSLKGPLWEEVSRKMADLGYHRSSKKCKEKFENVYKYHKRTKDGRASKADGK--SYRFFEQLEA 144
                 8********************************************************************975555..5*******85 PP

2trihelix98.27e-31469553186
    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                 rW+k ev  Li++r++++ +++++  k+plWee+s+ m++ g++r++k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+ l+
  CA04g13540 469 RWPKAEVESLIKLRTNLDMKYQENGPKGPLWEEISSSMKKIGYNRNAKRCKEKWENINKYFKKVKESSKKR-PEDSKTCPYFHMLD 553
                 8*********************************************************************8.99*********998 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.1253117IPR017877Myb-like domain
SMARTSM007170.004257119IPR001005SANT/Myb domain
CDDcd122034.26E-2559124No hitNo description
PfamPF138372.8E-1959145No hitNo description
PROSITE profilePS500907.584462526IPR017877Myb-like domain
SMARTSM007170.0041466528IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.4E-4468525IPR009057Homeodomain-like
CDDcd122036.21E-28468533No hitNo description
PfamPF138373.2E-21468555No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 666 aa     Download sequence    Send to blast
MLGVSGIISD GDGTGGENPE SGGGGATSGG SSEIGLGGGG GSSSGGFMIE EGEKNSGGNR  60
WPRQETIALL KIRSEMDHVF RDSSLKGPLW EEVSRKMADL GYHRSSKKCK EKFENVYKYH  120
KRTKDGRASK ADGKSYRFFE QLEALENTQS HHSLLPPSNS RPPPPPLEAT PINMAMPMPT  180
SNVQVQASQG TGTPHLVTVS STPPPPPPNI PFAPSHQNVQ LSSPVAPSSQ PAVNPINNIP  240
HQVNANLPAH QNISAMSYST SSSTSSDEDI QRRHKKKRKW KDYFEKFTKD VINKQEESQR  300
KFLEKFEQKE RDRMVREETW KAQEMARLNR EHDLLVQERA MAAAKDAAVI SFLQKITEQK  360
NIQIPNSINV ALPSAQVQLQ MPENPPPAPA PTHSPQPQQQ TQPTVVISPA PQPSPALVVP  420
VSLPMTIPAP APALMQSLPL TPPVPAKNVE LAPKNDNGGE SHSPASSSRW PKAEVESLIK  480
LRTNLDMKYQ ENGPKGPLWE EISSSMKKIG YNRNAKRCKE KWENINKYFK KVKESSKKRP  540
EDSKTCPYFH MLDALYKEKA KNETSSLTSG FPLNPENNPM EPIMARPEQQ WPLPRHHQQH  600
QHHQQHESSR MDHDHDHESD NMDEDDHDDE DEDEDEENAY EIVANKQQPS MASANTTTTT  660
ATTTV*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1270277QRRHKKKR
2270278QRRHKKKRK
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016569487.10.0PREDICTED: trihelix transcription factor GT-2-like
TrEMBLA0A1U8GDS50.0A0A1U8GDS5_CAPAN; Uncharacterized protein
TrEMBLA0A2G2ZPT70.0A0A2G2ZPT7_CAPAN; trihelix transcription factor GT-2-like
STRINGXP_009618661.10.0(Nicotiana tomentosiformis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA24022459
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.16e-48Trihelix family protein