PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG75461.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1814aa    MW: 192510 Da    PI: 6.6752
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG75461.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix30.59.4e-106777342279
    trihelix  22 rrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstc 79 
                 ++g ++ + W +++++mr++g+er+  qC+ +w+nl++ y+ + +++ ++   ++s +
  GBG75461.1 677 EEGPRANGAWHRIARAMRAEGLERTWDQCQTRWKNLRRWYRLVVNHDHSQIGCHRSYW 734
                 34568899*********************************99998887643333355 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1814 aa     Download sequence    
MTRGDWDPLE RLRGLLRTRA MCLDALLQPR QPLSGPHLWT DTPYVAVRMT HGGAGMAGAL  60
QPINNGSITD LGERPRGGDA GVLCSRGSDL GGTGHVSVLC TRGTDLGACR PGACTRESSD  120
FSACTRGTDL RACTRGTDVP ACTRGTDVRA CTRSTDVRAC TRGTDVRACT RGTDVRACTG  180
GTDVRACTRD SNAAGRIEAA AVCTRASEAD LRAGGSGSPP VVDAACTRGD PRGHGGDPRG  240
ALKVGESGAR GRGDDVVVGT RGRGSDLDVG ARGSGGGVDL CTRGGGADDV RTSGSGVGGQ  300
RTLGSGSLAV GAPRPVAEEQ TISEAASRLF GQPCENQQST PTGLFRNEKE KDNANASVVV  360
VPHVPRVCGR EQSNVRPVFV RKRGAGSGGG GKDRGRKDAG GVDGRRKNGG GKDGGIKNGG  420
GKDGGIKNGG GKDGGIKNGG GKNGGGKDGG IQNGEAAHRD DVAPRGGKDG GRNNGEGKEG  480
GGSDGGPARR DDVGPIYGLD DQEKDKNARC AMGPRGGRVA REKEETDKRM TVRKEACRDT  540
CCDDDVAILT KPATRVDVGI SVVMTRRSVR GGDLVDPSVT RTGSRQADGV EATDCPSLIG  600
SGGESREEKG DEVAGDVADG KRVRARRIYR DRWGDYDTTV LIGLLGEERS ASRGLMGGRG  660
GGGGGGGGQE QGQSTEEEGP RANGAWHRIA RAMRAEGLER TWDQCQTRWK NLRRWYRLVV  720
NHDHSQIGCH RSYWSMNPSE RCEANLNFDL RRNWFAAIGA YLGGGDLDMP RRRRPGGGSA  780
PAPAPAPAAA SQGGIGHVSG PGNSANAPRS PDLHSVRQQQ QQQQQQQQHH TNVEASTFQK  840
LPRVVTQSTG EWTHQSGGEG WRSGDGGGEG RRSGDGGGGL QALNNNGEGS SSWRKTSDDE  900
SHRRMDACHV SCSRRRRAPD DGSGDYNSSS SSVARSHNSD IRGCVEGTPS PPSISSPSRL  960
VVVEAAGNEG RSGDCRSAGE RAELVENVSG GGHYEHCEEY VEARSGDGRV SGSPLHRHPQ  1020
KDDVVAATRE GVDFADLACH NAEGAHDCHV SCSRIHRQHQ DKVGVGDGEG RGGGCAHPDE  1080
GGRSGDDDYV AGSRMHREHH ESEGGGGYGE GVSGGCTYQG DDGRSGDCCR VSYSRVHSRQ  1140
HRDDIVDVGN AAGAGCACQK GEPLSNEEDG ELQKSDKEGN GEPSSKEKGV LQSDKQGSEG  1200
WREKKRDENS SRREKKSEET VRWDPGYLSG CRGVKRTRSV EDYSEEGVLQ SDKQGSGGGR  1260
EKREKKSDEN SSRREKSRED TLRSERGHVS GCGGVRRTRS VEDYSEEGVL QSDKEGSGGG  1320
REKRQKKSDE NSSRREKTSE DTLRSERGHV SGCGGVRRTR SVEDHNSFGA QVDGDTVHLP  1380
PSPPSPLPYP SPKRRRGTAA SDVMAAMKEE EDSSSRDLVK GETIREGEGG LKECHVSDLE  1440
QLPLKEEWAE LGAAMSKDMT ELRAAMTEFR AVRIWTRRTA AVGISTLELE GRGGRSPCIC  1500
KDMAAELPAA TSTKAMEKDS SGVVMRQQQT SEEAGGLKEC HVSDLEQLPR KEEESGELPA  1560
AAVRTKAMED DSSGAVIGEP TREGVGLKEC HVSDVEQLPL KEESSGKLPA AAGRTKAMEE  1620
DSSGAVIGEP TREGVGLKEC HVSDVEQFPL DKQWSEFRAA LSKDMAELRA AMIAFRSGIS  1680
EFCAARRWQP SFGLQGDGGR APCSCTDMAP ELRRAATSTM AMEEGSSGVV TREEPTSDVM  1740
GLKECHVSGV EQLPLKREWA ELRDALREDM SELRAATAEL RATCQILERC DTWHEAPAGV  1800
SVLGEEAIMY QSRV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1439446GGGKNGGG
2440447GGGKNGGG
3657664GGGKNGGG