PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG71018.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1473aa    MW: 160490 Da    PI: 6.3417
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG71018.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix34.26.8e-11899974270
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekk 70 
                 W+ ++  aL+ a+r        m++++ r k ++ +W++v + +++ g+ rs + C +kw+nl +++kk+ + +++
  GBG71018.1 899 WSVDHMVALVSAKRdqdahleGMGHAFARMKPREWKWQDVHETLKKVGVARSGENCGKKWDNLMQQFKKVHRFMQE 974
                 999999********77777777777888999**************************************9887665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1473 aa     Download sequence    
MVCMRTLLSR GGTVPGDVIN DWERGISAAR TMSHQAGGVP LVVPCSVPWV VCFHDAKAAK  60
LDLHTATWSL NALTDRIAGS CVLNDTRRRG CLARRERSIA AGGPTADAVW NPSEKDHTLA  120
EKSSCSCLHT TSDLAVNWNA DESRGNGSHS KSNQASESLA FSRTSVKDIM AVHLAAEEQL  180
SADSARGYSS EDGPEPSMQT TTVRDDSGEG HLQARQMEER GLQFWIGDAF YRAESALSRD  240
LSTLAAAVYR DKHGRLTVID AMAGCGVRAA RYLSQAGADF VWANDASIAN HLPLVQNLSR  300
CSQLEACNGS ARTLGSGPEV EWGMEAATEG VAPGGDSTWR VSHDDVVRVL SSRRSGPSRE  360
DINGAANVKD DHFDIVDIDS FGSDAWMFSS ALQAVRYGGM IYVTCTDGLT AGGHQPLTSL  420
ASYGAMVLRL PYANEIGIRM LIGGLVRQAA MWGLHAWPVF SHYSYHGPVF RAMMRIQKIK  480
GSHSWPSEWY AFIGYCHSCG ESRVVQWEDL GNVFCACRQD AGRPMQLTGP LWVGPLHTEE  540
DVREMESKAE SWGWITSGQT KEAGGSRSKG KSLDLRTLLE IMLEECQPGL SPYYTSLDLI  600
SRFGRIQTPR RDALVQALRQ EGYIAGRSHV ERSSVKTNAP MAECIRNCGH VLITVVVVVM  660
CRSARGVPAM VPDEEDAHAD VNLLFGLCSG SSTAMTRKLI INPHPDGDWG DAALVGRSGG  720
GSSIGSPATK KTRDRHCLQS KSAPATRPEV VRSQRMQLPS PLSGGQVSGR RSGGGGVDAR  780
FAVDVDERTG RRVWAEHRQQ LRGDREDAIT RGVRQLHVGG SDTDKEAAVE VDDFEDMDEE  840
DEDNEEAAVD IRPVGRTSMG GWGRSKKAYA AHGRQTKKTA SGGSDGEGDI DVDSGRNFWS  900
VDHMVALVSA KRDQDAHLEG MGHAFARMKP REWKWQDVHE TLKKVGVARS GENCGKKWDN  960
LMQQFKKVHR FMQESGKPDY FQLTGKERRS HGFNFVMERA VYEEIKGSTT KNHTIHERNV  1020
ADTGAPGGVE MSSGSWGGRG DGGGVGDVGG EPQEEEDGST KVKETESSTS LTPRRQAVQD  1080
EAVSGAFQRA VGSCGTGGAG GVVAAAGAGG AVLTRAAGGA AHAVNNTDDE DEEPLANRVW  1140
RGNGPIVLAE QTRLWVDDNR FWNDTEDNRM YKIVNGTRNY LVSVVRGVNP LSRSVVLPHS  1200
SIAQGTITDE SQLREATERA AKVQRVVMRV IHGWIFKSTS RSQGYTAAYG YLLQHVATDI  1260
TRAMWCAEEW SVCVSSVVCH VTLELGMQVP LWFVSTHIED RPEDDELAIY QEETLQRLVA  1320
GFTSAVVVAE LQDGGRLSHD RLKNVAEGMK VFVVVVMWLM RMSGHDLRSH YAASFFVQLA  1380
AKPTLLASMH CSFDARRHIT QAANVVTERL CKPPMVLADS PHYILDWASC GVKFGHDANL  1440
PSPDDAKRLD WLGTGPLEEE DDDDGEQQKV GGR
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
110371044GGRGDGGG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.12e-07Trihelix family protein