PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG91951.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 932aa    MW: 102814 Da    PI: 6.2726
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG91951.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix37.18.3e-12320393268
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikege 68 
                 W+ +++ aLi+a+r        m+++++r k++k +W +v+++++  g+e ++++C +kw+nl ++ kk+ + +
  GBG91951.1 320 WSVEHIIALIRAKRdqdvhlqGMGHAYGRMKARKWKWNDVARRLKNVGVEWKAEKCGKKWDNLMQQLKKVHHFQ 393
                 9*************66666666666666889**************************************98765 PP

Sequence ? help Back to Top
Protein Sequence    Length: 932 aa     Download sequence    
MADVFRQLPL LLPVVFLLVV VVVFHVCFLG NAPRRRRRRT LPMKPVRTDS RQGSPRGGGE  60
QVVSRCVIGS SSSVCRLSLG VGNSALPPHL QPLPDSSDEE EREGRARTVP LGSGSTQEWA  120
WTELCGGSVG VHEQSFTELL RPGLDGEERH GGVNLSSAVA GTIDFHLQER AGSPTFHAVT  180
VSGVCCVQPR TSTWHCCGGG GNLDDVGDDR DGRLLWAEQR RELREGREEA IRRRVERLRM  240
DRQAEEVEEP DAGLPSEEDD DENKGEGGDG NGVHASPSEN SDMGGKGGKT QAKSGNGRGR  300
PKKAQAKPND GEAEEKRNFW SVEHIIALIR AKRDQDVHLQ GMGHAYGRMK ARKWKWNDVA  360
RRLKNVGVEW KAEKCGKKWD NLMQQLKKVH HFQSSSGGID FFQLSAKERA SKGFNFNMNR  420
VVYDEIEGLT GFNETINSRN VADTGASRGV RLPSTSNGDP EAVGDADAGA GGDDEEEGST  480
RGSSQTTGSP GAFGKRKSTR QQTFEAMTDC MEKHGALMAA TMESASXRQC SIQLRXXEAL  540
EXEVQAVIDV SAKRSTAPQP RGEAVPVVRE VADGAKAGDG GAVGEDDEAL VNKLRGQRAE  600
AKAMEVTARL WTDDIRFWND TRGHKIIQII HEARVYLVDV ATGVQPSMIR RSINLPHSSI  660
PQKKIEDGSE LRAAKERALK VGNIAKRAIH GWIFKSDSRH KGYHLAYQYA LNHTATDIAR  720
VMWVSEDWRS LVSPMVIRNT LELGMELPLW FVGANVVDRH QDDECATYQE TIAQRLVCDF  780
NNVVEAAQVM DSGRVSYERL KSLAEAMRYL LAAAVWIMRM AEDDSRSHYD SWVFVQLTAK  840
TTLLASMDRH FDSRRHVLQA ATVMTDKLGR PPPTFAPPPL YIPDWASKCG VTFNHDATLS  900
SPMEATKMEW IDIGPPEEED DDDEDVDGAE GG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
13439RRRRRR
23440RRRRRRT
33541RRRRRRT
4297305GRGRPKKAQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.12e-07Trihelix family protein