PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CA12g00370
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family Trihelix
Protein Properties Length: 403aa    MW: 45363 Da    PI: 6.3323
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CA12g00370genomePEPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix79.26.2e-25292375186
    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                 rW+k+ev+aL+ +r+ +++++ +g +k+  Weev++ +++ g+ r+pk+Ckekwen+nk+yk++ ++ k r +++ ++cpyf++l+
  CA12g00370 292 RWPKSEVQALVSVRTCLDHKFLKG-AKGSVWEEVADGLAKMGYIRTPKKCKEKWENINKYYKRTIDSGKTR-PKNYRSCPYFHELD 375
                 8**********************9.99******************************************98.888899******98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
CDDcd122035.27E-20291354No hitNo description
PfamPF138375.5E-20291376No hitNo description
PROSITE profilePS500907.073292348IPR017877Myb-like domain
Sequence ? help Back to Top
Protein Sequence    Length: 403 aa     Download sequence    Send to blast
MEPPLTDGAV RSSNDVASFA VNLSPFPQST TNAVYQTSVI PPPEIGQLPV RKLRPVRSYV  60
DNMPCNNSNS DTMMTCSSDL GCMSQFPAQN ASTVLPREGD FVDSLAGNGV PCLGNSCSEA  120
KAEISAVQQM KSEFHADCLN LISQARILEA EFSSSSDDSE SSEAIEEHLN RKRKRKTRNS  180
IKLYLEDVVR RLMDKQEQMH KQLIEMIEKK EHERIAREEA WKQQEVERAK RDGELRAEET  240
SRNLVLISFL ENLLGEEFQI PKSSEISSVV KDEGEFHGQE ADARSSDPCN RRWPKSEVQA  300
LVSVRTCLDH KFLKGAKGSV WEEVADGLAK MGYIRTPKKC KEKWENINKY YKRTIDSGKT  360
RPKNYRSCPY FHELDTLYKN GLLNHGAGNC VKSETESKNA EE*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1170175RKRKRK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00552DAPTransfer from AT5G47660Download
Motif logo
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9755230.0HG975523.1 Solanum lycopersicum chromosome ch11, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016549592.10.0PREDICTED: trihelix transcription factor GT-2
TrEMBLA0A2G2Y4A80.0A0A2G2Y4A8_CAPAN; Uncharacterized protein
STRINGSolyc11g012720.1.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA128841620
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G47660.14e-50Trihelix family protein