PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_023922215.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fagales; Fagaceae; Quercus
Family GATA
Protein Properties Length: 1134aa    MW: 123662 Da    PI: 7.4082
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_023922215.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA52.76e-179961027132
            GATA    1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrk 32  
                      C+nC+t+ Tp WRrgp+g++ LCn+CGl+++k
  XP_023922215.1  996 CVNCHTKVTPEWRRGPSGQRDLCNSCGLRWAK 1027
                      ******************************99 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1134 aa     Download sequence    
MYRYGYHEDG YDVNGMGQTE EEGAGNYMLG QDVSMSGGQS LDEIVNQSAK IIRRQSMPQQ  60
QQLSQQYSTS PGMDPSLGAD VQRATAMMDF GGASPAVQTG AYQFNSNIMD QGGGMMVSPM  120
QIQMPSQTTQ LSQAQGAPQH RGSGDNPQSS GDLTDLSNNY NNSYDAMGHS SGAFSASPAH  180
QRSSMDMNMG ASQLNHNLRL SMDYAVDQGM NTPMSGAGMN MDMFGPVQYS NQMMNSPMQQ  240
ASSQAVSQGS NVQQRQMSQG SVSRSASVAT GQLHPPSRAH SMHTTSEKSP AQHGQTHNAP  300
SSLLAQQGAH NQFQTSGSQQ NQASNLSRNS DSYQNYQQQA QSQTQSQPEL AQDQNMQQTT  360
SGPHSSSYDG VNGPVPINLT TYNPNNQGFK WETPEGGWPS TLVGRPHAQS SYKNAYSATG  420
FDMLGVLMRV ATRPNPQINI GAVDLSCAFV VCDAEMDDTP IVYCSDNFER LTGYTKHMIL  480
GRNCRFLQSP DGQVSPGIRR KYVDDDSVLY LKNMTNLQRE AQISLINYRR GGQPFMNLLT  540
MIPITWDSDK IKFFVGFQVD LVEQPNAVTD KNPGGSYSIN YQRGMTMPPY VMPAPESVSK  600
TELGQTVSRE DVSTILATVG NGESEYAKRI WDKVLLENTD DVVHVLSLKG LFLWISPSSA  660
RVLEYEPGEL IGTALSSVCH PSDIVPVTRE LKDTSSAASV NVVFRIRRKV SGYMWFEGHG  720
SLHTEQGKGR KSIIMVGRER PVYTLSKKDL DGLGSIGENE LWSKLSTSGM FLFVSSNVRQ  780
LLDRQPDELV GVSIQTLMRS ESKAEFGRIL EHARTGMKAS VKHEMINRRG QVLSAYTTVY  840
PGDAQEGQKP TFILAQTRLI KFSRGAQMVR NNSTYSKSER NTSEPAPAKV AVHTGLSGSG  900
TSPNSVVSSG QYRNTDGTVV TYAGQFGLVI GHQDQSLASE DNLFDELKTT KSSSWQYEIR  960
QLEKRNRILA EEVQSLIAAK KKRKRRKGAG NMQKDCVNCH TKVTPEWRRG PSGQRDLCNS  1020
CGLRWAKING RVSPRTNSVY SGGAASDKAS KASASPLHQS NLPHQQLPSP FGSTAAVKSE  1080
FPTPNPSSCV VPTSEGRESQ PPPSKALRLS GDHSASMEGA SGVPASIEED VEPD
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1980986KKKRKRR
2981987KKKRKRR
3981986KKRKRR
4982987KRKRRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G36620.11e-08GATA family protein