PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID cra_locus_6071_iso_2
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Gentianales; Apocynaceae; Rauvolfioideae; Vinceae; Catharanthinae; Catharanthus
Family Trihelix
Protein Properties Length: 538aa    MW: 61006 Da    PI: 5.1692
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
cra_locus_6071_iso_2genomeMPGR-
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix75.49.1e-2448128186
                             trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtse 74 
                                          rW+++e+laL+++r++m+ ++r++ lk+plW+evs    e gf+rs+ +Ckek+en+ k++k++k+ +++r+++
  cra_locus_6071_iso_2_len_2158_ver_3  48 RWPREETLALLKIRSDMDLAFRDSTLKAPLWDEVSG---ELGFHRSARKCKEKFENIFKYHKRTKDCRSGRQNG 118
                                          8**********************************6...78*****************************9777 PP

                             trihelix  75 ssstcpyfdqle 86 
                                          +s  +++f+qle
  cra_locus_6071_iso_2_len_2158_ver_3 119 KS--YRFFEQLE 128
                                          76..******97 PP

2trihelix95.93.7e-30359443186
                             trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtse 74 
                                          rW+k ev aL+++r++++ +++++ lk+plWeevs +m++ g+ r++k+Ckekwen+nk+y+++ke++k+r +e
  cra_locus_6071_iso_2_len_2158_ver_3 359 RWPKAEVEALVRLRTNLGMQFQDNGLKGPLWEEVSLAMKKLGYDRNAKRCKEKWENINKYYRRVKESQKRR-PE 431
                                          8********************************************************************97.9* PP

                             trihelix  75 ssstcpyfdqle 86 
                                          ss+tcpyf+ l+
  cra_locus_6071_iso_2_len_2158_ver_3 432 SSKTCPYFHLLD 443
                                          *********987 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.003545104IPR001005SANT/Myb domain
PROSITE profilePS500906.99247102IPR017877Myb-like domain
CDDcd122031.37E-1947109No hitNo description
PfamPF138377.2E-1547129No hitNo description
SMARTSM007170.026356418IPR001005SANT/Myb domain
PROSITE profilePS500907.422358416IPR017877Myb-like domain
PfamPF138371.0E-21358445No hitNo description
CDDcd122035.66E-23358423No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 538 aa     Download sequence    Send to blast
MVGESSVFLD NSGGGGDGSG GAASSAAEEM VFGGGGEEER GRGEAGNRWP REETLALLKI  60
RSDMDLAFRD STLKAPLWDE VSGELGFHRS ARKCKEKFEN IFKYHKRTKD CRSGRQNGKS  120
YRFFEQLEMF DNQPSLPSPA LNQIQSYAVE TTSATEPMVI KPTSVSLDFV ASRPTQSLNM  180
DALTPSTSTT SSSGRDSEGS IKKKRKLVDY FEKLMREVLE KQENLQNQLL NALEKCERER  240
MAREEAWKKQ QMDRIRKEQE ILAHERAIAA AKDAAVMAFL QKISEQTIPM QFPETPVPVT  300
GKHAGTDQVK TPSPLPENID KRDTVVENNI NKSDSVIEKA IEQQENGANE NFSQSSASRW  360
PKAEVEALVR LRTNLGMQFQ DNGLKGPLWE EVSLAMKKLG YDRNAKRCKE KWENINKYYR  420
RVKESQKRRP ESSKTCPYFH LLDSLYERKS NRVEQNPDWS GANLKPEDIL MQMMNRQQQQ  480
HQQPQQPQSL IEDGLRENMD QNREDEAEED DEEEDDENGN GYELVANKPS SVASMGVS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1201206KKKRKL
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00244DAPTransfer from AT1G76890Download
Motif logo
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_027075443.10.0trihelix transcription factor GT-2-like
TrEMBLA0A068UW860.0A0A068UW86_COFCA; Uncharacterized protein
STRINGSolyc11g005380.1.11e-162(Solanum lycopersicum)