PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG90767.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family GATA
Protein Properties Length: 1707aa    MW: 185344 Da    PI: 7.1677
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG90767.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA59.54.3e-1913351369135
        GATA    1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35  
                  C+ Cgt kTplWR+gp+g+k+LCnaCG++++k+++
  GBG90767.1 1335 CVECGTMKTPLWRNGPRGPKSLCNACGIRFKKERK 1369
                  ********************************986 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1707 aa     Download sequence    
MTLLTRSQTA GMDQKANETG EAYEARLAAI MTESKQRAEA SAAARKTREA EAERLRQITK  60
DQRQKHAAAA AKDADEERVR RREILFREET ALHAQARDWR QEAENGDSVD YGTRIALLLN  120
SVTDLLATCM AQQEDIHSLD HANQALTKRI QQLEQRPVAT SSAGPSDLVD RVNILEIDVG  180
TLKTETQRLD QQATCAYERT DEIGLCFLHT VAAAKSQPTD LSSDPRVVRL LDEFADIFES  240
PTGVVPDRPI SQEVILEAGV VLPKGCIYRM SEEQLTVLRA ELDDLLDKGW IRPSSSPYGA  300
PVLFVQKKNK DLRLCIDYRK LNAQTVKNAG PLPRIGDLLE RLGDAEFFSK LDLKSGYHQI  360
SIRPQDRYKT AFKTWCGHFE WVVMPFGLTN APTTFRAAMT NEFRAMLDWF VLVFLIYSRT  420
LEEHLEHLRR VLETLRRAKY KAIRDKCKFV RQELEYLGHF VTPQGISPLS DKIQAIQDWP  480
EPRNITDVRS FLGLAGYYQR FIKGYSKITA HLSKLQCEDR PFDFGTDVRE SFLALKAALL  540
SAEVLRIYDR LLPTRVTTDA SGYGIGHGVA STEEKGEALR LGADIPEATG SSSEARRQQA  600
LVVQGGSGGR EPANAECGFA MASNGQREKG FEPSTSAAVC SGANSRVQNV IESWGALSMI  660
VEAKEGGGHG HGYEQKDGEY EREEDKRQCG KEEQAISRDG VTASDSSSFS RDVSGLSKYA  720
GSAFASCPQQ QLHAYNTAQQ HLRVGDLCHI HEGSNSCHEP VTAKDDQPTL DRILLNQHEM  780
HASQPRSPQP KRCRRASDQS LPARNGPVAT DSLTTQAGLV EEGRADEREI SRGNERREVG  840
PETTAVVSMV EAVAGNKDNG WDNGNVGFVN AWDAKQRMNC PKAGEEEMGK GATKDNIYEN  900
KSAARGSGSG NRDESDAVDM STVDDKGTGY GGSNEVQWTA EMTATVDGTA TSAIRPEVGL  960
VSWKGHGDRS KALTRASAPC PAVGSKRAYM RRGTNSDMHD RSTKNSKFLN AKAQRGRKEG  1020
SAKMAEQVDR RRAAAAAEGE SGQVSSGSGL GMGSGLKPNG RKVCTGRGGA CLAALRVHAT  1080
GGGLGMVVGV SLQQLSRSSQ RNVGLATRHF GHGSPNRTVK QTKKKLAASS SPPQPLAAAS  1140
APKRQGLNSI EGKKGVRKTS SVRQGWRRES AQRVLKKRIA MQVGREEKRR RGGGLLRNQV  1200
GHGRESSSGG TSPSPNLSKN SDDDDDEEEE DEEDEEDEEE EYNDDDNYEE EDEEDEEEEE  1260
EEECPGKRLA WAVRKRNNHD SSGKAEVATS AEQHLDRRDL GRGGLVNATH CGEKVGRRNR  1320
AEEPVGTKPG SARVCVECGT MKTPLWRNGP RGPKSLCNAC GIRFKKERKA LALAEAQAQA  1380
QAQANAEGNE QQENSSRGGK LVSSAPCRKG KTTVGSAVKS KKHLHLKNQK RKTAGETAAS  1440
VTDASDGTIA TTVSMEAVKA DVNAAPTEGC QCSSRQHSPC PFASDPPPSA SLQTQTTSDP  1500
PPVSLPTQTG HAHHKMTNYS SQCHVADAPD KAVSSAKLDA NKEEGNGNAT GIGAQDDGHG  1560
GDGHSAIRGG EMVAECGDRV GSSADSSVTL QGPPAMVSGP LKRKWKRMMG RSDPAVHESM  1620
QIQRVSIAGS GHSRCVELPV NWVAQGKKKV RERNGMAAIM DGMPMDDQHG EGKNNLLLEM  1680
EFGGCGGTAD EVQGAILLMT LFKGCPA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
111671178RRESAQRVLKKR
211761191KKRIAMQVGREEKRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G36620.12e-15GATA family protein