PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG68314.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 698aa    MW: 75573.5 Da    PI: 9.7367
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG68314.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix452.8e-14443516268
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkikege 68 
                 W+ +++ +Li+a+r+ + +l+       r k ++ +W +v+k+++  g+ r+p++C +kw+nl +++kk+ + +
  GBG68314.1 443 WSVKHIITLIRAKRDQDAHLQgmghayaRMKPREWKWNDVAKRLKNVGVDRKPEKCGKKWDNLMQQFKKVHHFQ 516
                 9*************7766666433333367899************************************98765 PP

Sequence ? help Back to Top
Protein Sequence    Length: 698 aa     Download sequence    
MSKPVASPRC TPRCNSGLGF SRGGKPANGR QMRIFEAAQR VQKTTASVCR QARVAVCRTS  60
QLRCRPCSLP SSAVVVVHRM ADSFRKLPLL LLAIFLLVVV VVLHVCFVGS APRRRRRRTT  120
PMKAVRTDSG QGSPRGGGEP VVSRRVVGSS SPACRLSLGV GNSSLPPHLQ SLPDSSDEEK  180
REGRALTVPL GSGSTQKRSW TELCGGSGGV HGQSFTELLA PGLDGEEGHG GSNLSSGLST  240
GRCGSHTRTV IVNPHVGDDD EQLTVVDRSS KSASAQQWQG RATYISRSTQ GRPSFMQSPA  300
PGCAASNLGR RRGIVEEGGG YLDDVVDDRD GRLVWAEERR KIREGREEAI RRGVERLRMD  360
RQVEEVEEPH AGLPSEDDDN DGAGEAGDGN GGYASPSQNN DGGGKRGKTK ATSGNGRGRP  420
KKAQAKAKDG EGDGDAEEKR NFWSVKHIIT LIRAKRDQDA HLQGMGHAYA RMKPREWKWN  480
DVAKRLKNVG VDRKPEKCGK KWDNLMQQFK KVHHFQSSSG GIDFFQLNGK ERVRHGFNFN  540
MDRALYDEIE GSSGFNETIS PKNIADTGAR GGVRLPSTTT ADPEAVGDVD AGAGGEDEDE  600
GSTRGSSQTS GNPHGFGKRK STRQQTFEAM TECMDKHGAL MAATMDSASK RQCSIQLRQC  660
EALEAEVQVQ KTHYAASDEV SKLMCHALLD IAKAIRER
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1113118RRRRRR
2113119RRRRRRT
3114120RRRRRRT
4416424GRGRPKKAQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.11e-08Trihelix family protein