PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG70567.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family C2H2
Protein Properties Length: 745aa    MW: 83722.8 Da    PI: 6.4828
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG70567.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H221.84.9e-07307329123
                 EEETTTTEEESSHHHHHHHHHHT CS
     zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                 y+C+ Cg sF+ + +L +H+r+H
  GBG70567.1 307 YTCGMCGASFRKPAYLLNHQRKH 329
                 9*********************9 PP

2zf-C2H219.62.5e-06337360323
                 ET..TTTEEESSHHHHHHHHHH.T CS
     zf-C2H2   3 Cp..dCgksFsrksnLkrHirt.H 23 
                 Cp   C++s++r+++L+rH+r+ H
  GBG70567.1 337 CPieGCDRSYKRSDHLNRHLRScH 360
                 9999******************66 PP

3zf-C2H215.16.5e-05367392123
                 EEET..TTTEEESSHHHHHHHHHH.T CS
     zf-C2H2   1 ykCp..dCgksFsrksnLkrHirt.H 23 
                 ++Cp  dC+ ++  +snL rHi++ H
  GBG70567.1 367 FSCPypDCKVTCAYRSNLIRHIKNqH 392
                 89*********************988 PP

4zf-C2H2110.0013579602323
                 ET..TTTEEESSHHHHHHHHHH.T CS
     zf-C2H2   3 Cp..dCgksFsrksnLkrHirt.H 23 
                 C    Cg+ F+++ +L  H r+ H
  GBG70567.1 579 CAaeGCGMKFPDPRTLSEHCRNtH 602
                 77779****************988 PP

5zf-C2H219.13.5e-06633658123
                 EEET..TTTEEESSHHHHHHHHHH.T CS
     zf-C2H2   1 ykCp..dCgksFsrksnLkrHirt.H 23 
                 + C+  dC++sFs+ snL +Hir+ H
  GBG70567.1 633 HACTyeDCKRSFSNGSNLATHIRSyH 658
                 569999*****************999 PP

6zf-C2H212.40.00048665683220
                 EETTTTEEESSHHHHHHHH CS
     zf-C2H2   2 kCpdCgksFsrksnLkrHi 20 
                 +Cp C+k+ +++ +L+rH+
  GBG70567.1 665 VCPLCSKTLRDRRTLERHL 683
                 7*****************8 PP

Sequence ? help Back to Top
Protein Sequence    Length: 745 aa     Download sequence    
MDYGLLPIDY GKGDKLYLTT ISKTTTSSKD VARDEDWHCK GNVVESGGAT EQEEEEEEEE  60
EEEEEKEEEE KEEEEKSVEL KKEKKEKNKK KKKKSENSRT VCEEAKDWKG MRSGKERKEK  120
KGRKLRGGGG EYALGGRVES QEEEEEEEEE EEEEGGGEER GYYVGGGEGG VGEGEEGGGE  180
EIGVVEEAGG GVGQKSSPSS ASPSSSPSCS SSSSFSYSSS SFRCSSSSSS SSSSSSASWM  240
PARTQRMERK PRMQRKESTE RKESKERKES KERGSCATDG QTTTRRRKSN CLSSGGTMIS  300
SAQKRYYTCG MCGASFRKPA YLLNHQRKHT GEKPFLCPIE GCDRSYKRSD HLNRHLRSCH  360
EYGESSFSCP YPDCKVTCAY RSNLIRHIKN QHPSSKGSPM EGGGGGGGGG GGGGGGGEEG  420
NGESNKQKRR RKEDEEVCNK RRGMSSSQRA QEEEEEEEEE EEEEVGRQGG GAEGMGEENM  480
LKRRRKAEET GMKRKRRRKD SISTENKQEL KEEEEEEEEE EEEEEEEEDD DDEVRRRVKD  540
YIMDRTGIVG GCLVLFPNPS RRRLHQLDTL AATYASCICA AEGCGMKFPD PRTLSEHCRN  600
THRHVKCSVC SIQITRAKYK AHLRTHNLDR PRHACTYEDC KRSFSNGSNL ATHIRSYHLR  660
TYEVVCPLCS KTLRDRRTLE RHLGAVHSSP RDAEKKEGRE RTRKEVPFDF IETLATGKTC  720
LQKADADSEP KKSKRTKKTK SKTRT
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
181124KKEKKEKNKKKKKKSENSRTVCEEAKDWKGMRSGKERKEKKGRK
289132KKEKKEKNKKKKKKSENSRTVCEEAKDWKGMRSGKERKEKKGRK
390133KKEKKEKNKKKKKKSENSRTVCEEAKDWKGMRSGKERKEKKGRK
490120KKKKKSENSRTVCEEAKDWKGMRSGKERKEK
591134KKEKKEKNKKKKKKSENSRTVCEEAKDWKGMRSGKERKEKKGRK
691121KKKKKSENSRTVCEEAKDWKGMRSGKERKEK
7284289TRRRKS
8481499LKRRRKAEETGMKRKRRRK
9482499KRRRKAEETGMKRKRRRK
10483500KRRRKAEETGMKRKRRRK
11484501KRRRKAEETGMKRKRRRK
12493498KRKRRR
13493499KRKRRRK
14494498RKRRR
15494500RKRRRKD
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G72050.21e-15C2H2 family protein