PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GAY48145.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Rutaceae; Aurantioideae; Citrus
Family GATA
Protein Properties Length: 626aa    MW: 67609.1 Da    PI: 5.6508
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GAY48145.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA49.94.5e-16289324134
        GATA   1 CsnCgttk..TplWRrgpdgnktLCnaCGlyyrkkg 34 
                 C++Cg ++  Tp++Rrgp+g+++LCnaCGl+++ kg
  GAY48145.1 289 CTHCGISSksTPMMRRGPSGPRSLCNACGLFWANKG 324
                 *****99999***********************998 PP

2GATA48.71e-15478514135
        GATA   1 CsnCgttk..TplWRrgpdgnktLCnaCGlyyrkkgl 35 
                 C++Cg+++  Tp +Rrgp g++tLCnaCGl+++ kg+
  GAY48145.1 478 CQHCGVSEnnTPAMRRGPAGPRTLCNACGLMWANKGT 514
                 *******99*************************997 PP

Sequence ? help Back to Top
Protein Sequence    Length: 626 aa     Download sequence    
MPTQTGPANL QTSTADTSLP SLRQQPRPAA AVLTALPTLC EPPSISSSHL PPPCLQPSTT  60
IIATVSIVRH LVSGRPLRDG NSPSQIHSHE GFTQNLQKVF PFGNASKSPS FGPISPMYGQ  120
SQSMNISSQM SGGGAAADED DVSVAADDHH LSYDPHSALE NGIVVVEDVA HDSGYATGGN  180
ELSNSSQLTL SFRGQVYVFD SVTPDKGIAD YPAKCTQPQR AASLDRFRQK RKERCFDKKV  240
RYSVRQEVAL RMQRNKGQFT SAKKCEGGAL GWSNAQDPGQ DDSPSETSCT HCGISSKSTP  300
MMRRGPSGPR SLCNACGLFW ANKGALRDLG KKMEDQPLTP AEQGEGEVND SDCGTAAHTD  360
NELVQAVLLL LGGRDIPTGV PTIEVPYDQS NRGVVDTPKR SNLSRRIASL VRFREKRKER  420
CFDKKIRYSV RKEVAQRMHR KNGQFASLKE SSGASPWDSS QDGIQDGTPR PETVVRRCQH  480
CGVSENNTPA MRRGPAGPRT LCNACGLMWA NKGTLRDLSK GGRSLSMDQL EPETPMDVKP  540
SIMEGEFSGN QDELGTPEDP AKAVNQGSDN PSIDPDEEDM HGAAEDLTNS LPMGLVHSSA  600
DDDEQEPLVE LANPSDTDID IPSNFD
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1396401DTPKRS
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G24470.37e-65GATA family protein