PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG65279.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 998aa    MW: 110092 Da    PI: 8.2894
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG65279.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix292.8e-09251326270
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekk 70 
                 W+   + aL++++r        ++++l   k ++ +We+v ++++  g++r+++ C +kw+nl +++kk+ + ++ 
  GBG65279.1 251 WSVGDTIALVRVKRdqdvyivGLGHSLAHMKTREWKWEDVRARLEIMGVTRKAIDCVKKWDNLMQQFKKVHKFQNL 326
                 7888888999999966666666777888899**************************************9987665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 998 aa     Download sequence    
MLTCCAFCFP DDRMDRRLTC GRPAGMTARA TNRQRSAGSV AKQAYDPMLY AHLPSHEIPL  60
PPSDDEGEDA RSSTLSLGSG STQDWAPTQS CGGWRAESPW SYTSLLNEGL CNDDGNAAVD  120
LSVQLSSSSG AAATHTRIIN PHPDGDCAEQ LWAECGQALR QAGTETITRG LQPLHVDEGD  180
NAAVQEPLDC DDADGDEDCN FDDLPEIRPL ASKVRNSGAS AKKGPTPRNR RNKKMDDNTG  240
GSDGEGGWNF WSVGDTIALV RVKRDQDVYI VGLGHSLAHM KTREWKWEDV RARLEIMGVT  300
RKAIDCVKKW DNLMQQFKKV HKFQNLSGRK DYFKLASSAR RSEGFSFVMD PSVYDDMEAM  360
TKRDHTIHPK NLADTGKTGG VRCRQAQGLE GNPWPMRVEG RHPMPAVPFF CFFLLDVLHV  420
LGECFMRDLR SQERQGHVLK GLCGSRIVAD GGGVGGAALV VGGGGGGEVV GERMGGGMGL  480
LCANAVWNVV VTCVVVDDAD RRRSDVGARP SAPCYRGVCA PSRIKVVRFW RLQHRRSPHV  540
LHSEHNHAEH PIAFLLLALD RCRVLAADIS RLLVGDFFSV VNDHRRGSSR LLVMSTFRVV  600
YNARIPHRGA LRRRRRRRAE EEEAPKDIVS TLQGSGCQRS SDHIVARRLV TPPPEAQHVR  660
ARDTPKAKEV VDVGGEDDEP LESPRQRNVT RGAMATVVRA RGATNKRPPQ GGLPSMPSQL  720
HPRNTADEGG SMERGGGGEI QQEARVVGGG GGSLMRVREL QAMWHPRRER ERSFQLWSGR  780
WHVARTMRSA SACRPPSVVM LTSSTSLTRI VDPAQLQQAI SRAAAVENIA LRILHGWVFK  840
SGNCPRGYNL VFQYALESVA TDIARAMWYG EEWSNVVSAV VCAHAIHLSM DLPLWFTGAN  900
IEDRPEDDDM AAYQESTVIC IAHAFRVAVQ MGVNVDGGFI SHDRLSSVAN CFRLLLAPSI  960
WLMRMSGDDL RSHYEAFYFA KLFAKPTLVA FMHPSFKH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1604616RIPHRGALRRRRR
2608617RGALRRRRRR
3612617RRRRRR
4612618RRRRRRR
5613618RRRRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.12e-07Trihelix family protein