PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG92712.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 938aa    MW: 102182 Da    PI: 9.4559
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG92712.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix45.32.3e-14325398268
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkikege 68 
                 W+ +++ aLi+a+r+ + +l+       r k ++ +W +v+++++  g+ r+p++C +kw+nl +++kk+ + +
  GBG92712.1 325 WSVEHIIALIRAKRDQDAHLQgmghayaRMKPREWKWNDVARRLKNVGVDRKPEKCGKKWDNLMQQFKKVHHFQ 398
                 9*************7776666433333367899************************************98765 PP

Sequence ? help Back to Top
Protein Sequence    Length: 938 aa     Download sequence    
MADSFRKLPL LLLAIFLLVV VVVLHVCFVG SAPRRQRRRT SSMKAVRTDS GQGSPRGGGE  60
QVVSRCAVGS SSPACRLSLG VGNSSLPPHL QPLLDSSDEE EREGRARTVP LGSGSTQEWS  120
WTELCGGSGG VHSQSFTELL APGLDGEEGH GGSNVSSGLS TGRCGSQTRT VIVNPHVGDD  180
GGQLTAVDRS LKSAGAQQWQ GRATSISRST HGRPSFMQSP APVAEERRKI REGREEATRR  240
GVERLRMDRQ AEEVEEPHAG LPSEDDDNDG AGEAGDGNGG YASPSQNSNG GWKGGKTKAT  300
SGNGHGRPKK AQAKANDGEE KRNFWSVEHI IALIRAKRDQ DAHLQGMGHA YARMKPREWK  360
WNDVARRLKN VGVDRKPEKC GKKWDNLMQQ FKKVHHFQSS SGGIDFFQLN GKERARHGFN  420
FDMDLAVYGE IEGSSGSNET INPKNVADTG ARGGVRLPST TTADPQAVGD ADAGAGGEDE  480
DEGSTHGSSQ TSGSPHGFGK RKSTRQQTFE AMMECMDKHG ALMAATMESA NKRQCSIQLR  540
QYLLAAISEV DGSTFFLRAF HSIFNTRHAI SSRSVPSRRV LGKQVREGEV APKKGRHDPA  600
KRRRGIQGGK AAVGREVEAD WVDAQERREE DDDFEEVDEQ TLVRKVKQRT GGAIRIKRVG  660
AAEVAPDQRQ TPSSKRSEQA VGVASSSQAV VDQSTMRSPA PQPRGEAVQG ASAVADVAKA  720
GDGGAASEDD EPQVMKLRGQ RPEAKAMEPS AIRRSISLPH SSIPQKKIKD ASELRAAKER  780
ALKVESIAKR AIHGWIFKSD SRHKGYHLEY QYALNHAATD MARAMWALED WRSLVSPMAI  840
RNTLELVMKL PLWFVGANVV DRHQDDECAA YQEAIAQRLV RDFTNVVEMA QAMDSGRVSY  900
ERLKSLAEAM RYLLAAAAWI MRMAGDNARS HFDAWVFM
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
13439RRQRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.15e-07Trihelix family protein