PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG78468.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 968aa    MW: 106059 Da    PI: 8.1201
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG78468.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix34.17.3e-1193169270
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergf.erspkqCkekwenlnkrykkikegekk 70 
                 Wt ++  aLi+a+r+ + +l        r k k  +W++v k++++ g+ +r+++ C +kw+nl +++k + + ++k
  GBG78468.1  93 WTVDHMIALIRAKRDQDLHLAglghnngRMKTKTWKWDDVEKRLMQMGVtSRKAEDCMKKWDNLYQQFKTVHKFMGK 169
                 **************766666544433336789999*************9899******************9987776 PP

Sequence ? help Back to Top
Protein Sequence    Length: 968 aa     Download sequence    
MSTITRGVAN IDVRADDVFG DCDGAGGEDC AAGDESNDND EDGYGEMEIR PIGRKRGGSR  60
ATNKSTQPRA GRRGKKGAGD TSIGEVAKSR DFWTVDHMIA LIRAKRDQDL HLAGLGHNNG  120
RMKTKTWKWD DVEKRLMQMG VTSRKAEDCM KKWDNLYQQF KTVHKFMGKS GKPNFFTLTP  180
GERKERGFDF QMDDRVYSEM KAMSRGNHTI HPTNLVDTGV AGGVQMPCPR GGRNESGGSE  240
GCEDGQDDDQ GSTRDSTFNG VGGGGGDKRK NVRQQTFDTI ADVMKDHENL MATTVDIASK  300
RQCSILTRQS DILERELEVQ KEHYVKADHA NFMMGGLLWT ARRVHGVLRE HGMYKRRVRA  360
ALRTTVIHFP RLQRHCLCTE SSLQACTSTS PLHASAFDSA GSAPSCASST CASSNVAIIV  420
TAPWPVPLPA HHPRASSATC PSSTSTRPPS TWSFIDVLLH PRGCRSRFAD EMAPRNASGK  480
GRKDSGVGDG GETQKGRGHV PKSKRQRVDQ ASSEHNDDFQ AEEVVMVDPQ GTAGAMRLGF  540
GRDGVSREQL SAIKQSLVVG GAGTLPRTPK AAGVLITEAR VLRQLPVVGG QGHPRQPLPL  600
QQAGASGGGK APSVQKGDAT ASDTHTAAVD HSSRSVVAEG AEEARVDDVR RDDGRRDGKR  660
EGDDDDDRPL VTRLKGAAKE DDVEERSKLW VDCNAFWGQG PRKPLREAIE DPAQRKPALR  720
RARNVEKLVL RTIHGWIFKS SSRSTGFARA ESYISVDLVT DVARAVWQGY EWSKVVSPAM  780
VYHTLAMKMD VPLWFAGVKI VDRPEDDDMA ARQETTVLRV ADCWTNAVWC GQWADDGCVK  840
QERLSRLADC LRALLSACMW IMRMGGDNDQ SNYEAWFYVS MVAKPTMIAA GSYIFDWRRN  900
IVDTANLVLD RIGKAHLTLG DYPHCIPKWS DCGLVLRHNA TPKNAAEAAK HGWIGSGPPT  960
EDDADDGS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1502507KSKRQR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.18e-06Trihelix family protein