PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG82771.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1037aa    MW: 112467 Da    PI: 6.6531
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG82771.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix29.81.6e-09312386269
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegek 69 
                 W+   + aL++a+r        ++ ++ r k ++ +We+v  ++++ g++r ++ C +kw+nl +++kk+ + ++
  GBG82771.1 312 WSVGDTVALVRAKRdqdlyiaGLGISFARMKTAAWKWEDVRVRLQAMGVTRDAVDCGKKWDNLMQQFKKVHKFQN 386
                 8888889999999966666656666777899**************************************987665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1037 aa     Download sequence    
MVGRLPNGRP AGTSGGSINR QRSAGSVAKQ PYDPTLYAHL PSHEIPLPPS DDDGGEARSP  60
TLPLGSGSTQ DWAATQSCGG RGVETQWSYT SLLNEGLCDD DGDATVDLSF QLSSSSGAAA  120
THTRIINSHP GGDCEENTQG AVCGPRDGGL PQSVRDGGGD RNNSSLSVGG GFGARKRPDR  180
MRLSPAARSG PDATRGRQPA EDLIGGGPDV QRDGRQVWVE CRQELHRGET ETITRGVQGL  240
HVDEVEDVAV DEGQGCDDVD GEDGCKSDDL PDIRPLGRRA TRGGSGAKKG PATKPRRTKN  300
MDDDGEGGRN FWSVGDTVAL VRAKRDQDLY IAGLGISFAR MKTAAWKWED VRVRLQAMGV  360
TRDAVDCGKK WDNLMQQFKK VHKFQNLSGG KDYFRLASRD RRSEGFSFVM DRSMCDEMEA  420
MTKGDHTIHP TNLADTGAAG GVQMPAGAGG SGGTMGGDGG GETADEEQGS TKNSSFSAGS  480
GGGYGKRKNM RQQTFKAVAD VMEKHGALMG SIMDSTSKRQ CSMMSRQCEI LESEVELQRK  540
HYAAADEANR MIIKLSLRHI QYWRSVHTVT SDEIRAGRHL PFVLDAPRSH RMVAPVFALH  600
EWDFCTVVVD RARRSSSKHK DVHAEQTVLI VADNSTSFGG GRRHVPKLKR LRSEEPSPKD  660
PVRRGRYWAA SNEDEDDDVF TTEEEAADDT VAAPRGSSLQ TKNDQAAPRR LLTPPPERQQ  720
GRGQSNTKAK EIVVDVGDED NKPLESLRQR NAMQGATATG VRLRAAHEER PPQGAMPSTP  780
SQPRLCNTAA DGGSTERGGG AVAQQEARVA SAGAGAGCSG NVAVVAGARE EVPVVDREVA  840
RGENKGEKED DDPLLSRERR GGLARDLADR ARLWVDDKAF WTTGKGRRLY DIVHRMREHF  900
EAVASGVQAP VVSRSVVMPK SATTLTRIVD PAQLQQAIIR RGAAENIVLR LLHGWVFKSG  960
NRPRGYNLAF QYALESVATN LVRVMWYGEE WSNVVSVPVC AHTIDLNMDM PLWFVGMNID  1020
DRPEDDDMAT YQESTVM