PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Bobra.4_1s0081.3.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Elliptochloris clade; Botryococcus
Family GATA
Protein Properties Length: 830aa    MW: 89209.7 Da    PI: 8.1869
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Bobra.4_1s0081.3.pgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA43.25.3e-145487134
                GATA  1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkg 34
                        C +Cg  ++p WRrgp  +++LCnaCG+++  kg
  Bobra.4_1s0081.3.p 54 CDHCGRRNSPAWRRGPADKPHLCNACGVRFLGKG 87
                        9***************99999********97766 PP

Sequence ? help Back to Top
Protein Sequence    Length: 830 aa     Download sequence    
MRQVYERMSL HGSLRSWDAG QAGRGSQLNR KRRRKETPIC ILHSTGGASA GGACDHCGRR  60
NSPAWRRGPA DKPHLCNACG VRFLGKGTLE GYMPGMKQRE DFKLPDDNLC GALDEEGGDD  120
TDSAKTTMPQ LSGKTRCRSQ RTPRRNTTPR SDMVSGPPAV ADAESDDGRA ASCYSCWADC  180
VRMALSHHPG NARKVVSFLE NACGLAMAEP LSSPHFLAEL NRVRDLYWMD KNRAMDALKH  240
LLEDEAALTL SGAEALLTLS GASPMPDDRK RDALCSTEDV NSRDSSWAVC EECGRARETV  300
CWLPVWWRYL CGDQDYSKNP ELTCKDVGDF VAPRGLPRSE VSPVIRRGIR GQGPQNRFLP  360
ASVGRSNARR TSGVADMSTE GFKVVDNPVY DGYGSPKSPR HKPFTGLRSS FTDSLSASDF  420
EFLRWLPQGW RMDFGWQRLG EPGRVSPAGL NYFAPDGSGP YQSQEDVLAR IAAQPSFRPP  480
QPGATAATFT PGPAFHRSPS GSVLVSMCGG AVSTTGEAPP GAIHKDTPMV QAEMQSYILQ  540
AETPITEAPH LARSEVSLTH LPSAAGEPSS EPSPLSAKEP AKGGPRSKKA RTKRSGPSHP  600
RQRQAKTFEV DGDFCFGVRE LVPNTFELGG WPRDLQQSIN VVYKPAKAAI DGTVPDAALR  660
GQGKMTPPQG LLQKEGMREL EPAQAHAGSN DKGRATQTTL VKHKSEPVGK LHGQALTELH  720
GPPSLLPPPR LLPPARPALG MTDELVFCPV EGPPLIVATP EGGSVANPRG LLRAVSDPSS  780
AAKAYAANAL EQAGASPEPH DLKEVDWKVL CPPGPCCAVP CLPAVTGYL*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
13034RKRRR
23036RKRRRKE
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G17570.35e-11GATA family protein