PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG60471.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 993aa    MW: 108350 Da    PI: 6.8403
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG60471.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix33.11.4e-10289365270
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergf.erspkqCkekwenlnkrykkikegekk 70 
                 Wt ++  aLi+a+r+ +++l        r k k  +W++v k++ + g+ +r+++ C +kw+nl +++k + + +++
  GBG60471.1 289 WTVEHMIALIRAKRDQDSHLAglahttgRMKTKTWKWDDVEKRLVQMGVtSRKAVNCGKKWDNLYQQFKTVHKFMGE 365
                 **************888888855554335689999*********99999899*****************99876665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 993 aa     Download sequence    
MRRTSKTARK MERVQTRTTL SVGAAGSSRQ RCRSAEAVRR PYDLALYSHL PSHEIPLPPS  60
DDDGDDPRCS TLPLGSGSTQ DWMGSQLHRQ ASTPTYTDLL EGHTPAGYDA GLVIISFGLR  120
SGSAEEVTRT VIVNPGSGRT RTPAPVTTRT GGSVPCRTAT AGGTRDGRRP NEEWLATEVV  180
GRKFWDDHRR QSREASTAGI TRGVAKITVG ADDILGDEDG AVAEDCEADD GAGGDDEEDD  240
EEMEIRPVGR KRGGSRAAKK FPETQTRRRG KKGVEDGSAG EGSKSRDFWT VEHMIALIRA  300
KRDQDSHLAG LAHTTGRMKT KTWKWDDVEK RLVQMGVTSR KAVNCGKKWD NLYQQFKTVH  360
KFMGESGKPN FFTLTPGERK ERGFDFQMDE GVYSEMAVMT RSDHTIHPTN LADTGPRGER  420
GWTGRRSRVH EGFYQRQWCC RGQQTEECET TDLRHDCRRD EGARESDGDN RGRREQETMP  480
PRGASARGRK DGGGGDGGET NKGRGHIPKS KRQRINDASS SQTDDFLADE VAMVDAQGTE  540
GVARLGFGRD GVAREQLQAL KRSVVGGAVG MVPRTPNTAG VVVTGARVPG QLSSAAGQGQ  600
PRQPLPLQHV LASGGGHMVT AQKGDATASA SQAAAADHTA KGGVVEDGKR EGNDGDDRPL  660
LPRGKGAPKG DDLEEKAKLW VDCDAFWGQG PGKPLREAVG ECTDYFVAIA NGDAGVEPPS  720
MLIMPPNDVP RFKIDDPAQR DPALXRARSV ERVVLXTIHG WIFKSQSRST GFSRAESYIT  780
VDFATDLARA VWQALEWSRV VSPAPVYHTL AMKMDVPLWF AGVKIEDRPE DDDMATRQEA  840
TVLLLAECWT DAIWCGQWAD GGRVKQERLS RLADSLRALL CAVMWIMRMG GDNDRSDYEA  900
WSYASMIAKP MMIAAGSYIF NWRRHVVDSA NLVLDCLGKA HLTMGDYPQC ILEWCDCGLA  960
FGHNAALKNA AEAAKHGWIG SGPPTDDDGD DGK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1509514KSKRQR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.16e-06Trihelix family protein