PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG64812.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 809aa    MW: 88867.7 Da    PI: 6.2011
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG64812.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix331.6e-10207283270
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergf.erspkqCkekwenlnkrykkikegekk 70 
                 Wt ++  aLi+a+r+ +++l        r k k  +W++v k++ + g+ +r+++ C +kw+nl +++k + + +++
  GBG64812.1 207 WTMEHMIALIRAKRDQDSHLAglahttgRMKTKTWKWDDVEKRLVHMGVtSRKAVDCGKKWDNLYQQFKTVHKFMGE 283
                 **************888888855554335689999*********99999799*****************99876665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 809 aa     Download sequence    
MGSQLHRQAS TPTYTDLLEG RTPAGYDPGV LDLNFGLRSG SAEEVTRTVV VNPASGTTHT  60
PAPVTTRTGG SVPCRTTTAG GTVDGRRPNE EWSATEVVGR KFWDDHRRQS REASTXXIXR  120
GVAKITVGAD DILGDEDGAV AEDCEADDGA GNDDEEDDEE MEIRPVGRKR GGSRAAKKLP  180
ETRTGRRGKK GVEDASAGEG SKSRDFWTME HMIALIRAKR DQDSHLAGLA HTTGRMKTKT  240
WKWDDVEKRL VHMGVTSRKA VDCGKKWDNL YQQFKTVHKF MGESGKPNFF TLTPGERKER  300
GFDFRMDERV YSEMAAMTRS DHTIHPTNLA DTGATGGVQM SGPRGGRNES GGSEGGGDGQ  360
DDDQGSTRDS ISGGGVGGGS KWKNVRQQTF DTIAHVMKEH GSLMATTVDS ASKRQCSILT  420
RQCDILEREV EVLKEHYVKA DQANLMMCEA LMEIAKAIRE RSNGKREGND GGDRPLLPRG  480
KGAPKEDELE EKAKLWFDCD AFWGQGPGKP LREAVGECTN YFVAIANGDA GAEPPSMLIM  540
PPNDVPRFKI DDPAQRDPAL RRARSVERVV LRTIHGWIFK SQSRSTGFSR AESYITVDFA  600
TDLARAVWQA LEWSRVVSPA LVYHTLAMKM DVPLWFAGVK IEDRPEDDDM AAWQEATVVL  660
LAECWTDALW CGQWADGGRV KQERLSRLAG SLRALLCAVM WIMRMGGDDD RSHYEAWSYA  720
SMVAKPTMIA AGSYIFNWRR HVVDSANLVL NRLGKAHLTL GDYPQCIPEW CDCGLAFGHN  780
AALKNVAEAA KHGWIGSGPA ADDDGDDGK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.17e-06Trihelix family protein