PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG74258.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 764aa    MW: 84047 Da    PI: 9.1825
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG74258.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix39.12e-12147222270
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekk 70 
                 W+ + + +Li+a+r+ + +l+       r k ++ +W +v+ ++++ g+ r++ +C +kw+nl +++kk+ + ++ 
  GBG74258.1 147 WSVDDIITLIRAKRDQDAHLQgmghayaRMKPREWKWLDVATRLKKVGVDREADRCWKKWDNLMQQFKKVHHFQGL 222
                 999***********777766643333336789999**********************************9887665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 764 aa     Download sequence    
MQSSSPLSAG SSAGRRLGEC RETAPAVADV GDARDGREVW AEQRQLMRSV REESTTRGVQ  60
RLRVGEDGHN GEEAVGDAHD PDWNDNGAEG GEDDAGYISP SKQAAAMGGR GGKTKSYAGN  120
GRRGKRTAGR GSDAAGDVDG EGGRHFWSVD DIITLIRAKR DQDAHLQGMG HAYARMKPRE  180
WKWLDVATRL KKVGVDREAD RCWKKWDNLM QQFKKVHHFQ GLSGKQDFFQ LSGKDRMSKG  240
FNFNMDRAVY DEILGSTAKN HTINPKNVAD TGAQGGVRLP SASSADPKSV GDGDGGAEHD  300
DGDDGSTKRS LQTTGGAGGF GKRKSTSKQT FEAMTECMEK HGALMASTME SNSKRHCSIA  360
IRQCKALKAE IEVQKKQYAA SDEATQQRGE GSNVGEGATA GVKAGDVAGE GDDGAALVNR  420
LRQRNTREGM EAAAKLWVDD LRFWNEREGF AIVKLIAEVR GYLVAVARGE QPPPIRRSIV  480
LPHNSIPQHK ITDDSELNAA KERALKVQGI ALRVIHGWVF KSQNRQRGYH AAYQYALNLA  540
ATDIACAMWM GEDWRHCVSS MVAHHTLDLD MKLPLWFVGA DVEDRHEDDG LAAYQEASIQ  600
RLVGAFTSAV IIAEATDGGC VSHERLKTMA DAMRMMLTAT MWLMQMAGDD HRVHYHAWVF  660
VRRPTHGEAD ARRIHASLLR RASAHRASCD RNHRQVGKSP HHFDRPANVC PRMGIHRCEV  720
ITRRHAVFPD GGEEGGLVGH RPPGRRRRRE RRRIGKRGGV DDAF
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1745752RRRRRERR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.14e-08Trihelix family protein