PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG79081.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 936aa    MW: 102262 Da    PI: 4.4145
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG79081.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix35.91.9e-11319393269
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegek 69 
                 W+ +  l+L++ +r        ++++++r + k+ +We+++k+m+  g  r + +C +kw+nl + ykk+++ ++
  GBG79081.1 319 WSLDDQLLLVRCKReqdmhlaGLGHNYGRVRTKEWKWEDIAKRMANAGSPRDADECMKKWDNLFQNYKKVQRFQN 393
                 7888888888888833333333444555778**************************************987665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 936 aa     Download sequence    
MQAGAPQVAG QGGSLPSTAA PRRQYDPSSY SHLQSWETPL PPSDEEVEAE ELGMLPLASG  60
STQLLFSQTL IAGGSASKEG GEFTSLLEAG LDHDDDEKVD LRFGLSSGSA REASRTFIIE  120
EQPSPRYLQR PRRGHTEQST LRGGASLTAC VRPSSTARHR GPSAPSIDPL PKTLPTRSGV  180
AAAAARISEV AAAPARNSFG RSNMSNPTPR PEDEFRGDPA CRPVVRPQPT VENITEGVSN  240
MRAHNDGVDE NGGAGDDADD GFREDVEAVD DDGDAPIRPL GKAGGRGRGR GRGGGRGRSA  300
GRGSRVADDD DGEKSATYWS LDDQLLLVRC KREQDMHLAG LGHNYGRVRT KEWKWEDIAK  360
RMANAGSPRD ADECMKKWDN LFQNYKKVQR FQNASGEADF FRLSNEERKD HNFMFRMERA  420
LYNEIHGGML GNHTISLPTS QTQGGASSRF SFRSGRMAPR RGSGGQARGK KRDEITEETA  480
GVSGGGRHMT KTKRSRQDAG SVSGWHLDGD DWGRPNDGEG HNEPGYLAME EVARMEQDQI  540
TPVTTPRGRN REDAGTGTSV ATTHVQQAVG ERGSGGAATP RVQRALGERG SSSAGDAAML  600
GDQAQVRGAA AVVGEAAGTP CETGGGGRKA EEAARTLEIP RAKKRKAMED NKPLVNTVRK  660
GEVVKELADR AKLWVDDKSF WRSGEGCRLY NIVNNARKYL VAIARGVPLP KVPKSVALPR  720
SKVTMTRLID SAQLQGAMNR ASKCQNVVMR VLHGWVFKSG SRERGYTLAF QYLLESVATD  780
FVRAMWLGED RSNVISPAIY AHTLELRMDL PLWYSEDADG DDDDDVDDDD DDDDDDDHHN  840
DDDDDDDDDD GDDDDDDDDH DDDDRDDDND DDCDDDDDDD DDDDDDDDDD NDDHDNDDDD  900
DDDDNDEDED DNRSEVARAR YFFLCVFGQV GRCSTL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1288297RGRGRGGGRG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.12e-08Trihelix family protein