PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_023929738.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fagales; Fagaceae; Quercus
Family GATA
Protein Properties Length: 1112aa    MW: 121430 Da    PI: 7.1866
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_023929738.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA52.28.2e-179921023132
            GATA    1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrk 32  
                      C+nC+t+ Tp WRrgp+g++ LCn+CGl+++k
  XP_023929738.1  992 CANCHTKVTPEWRRGPSGQRDLCNSCGLRWAK 1023
                      ******************************99 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1112 aa     Download sequence    
MYSYGYHEDG YDSNGMGQTE DGSPGNYMLG QDVAMTGGQS LDEIVNQSAK IIGRQSMPPQ  60
QLSQQYSASP SLDQSLGADV RQGTTMMDFG GTSPVVQTGP FQFNSNIMDQ GGGMMVSPMQ  120
MQMSSQTTQM PQAQSSTSHR GSADNSQPSG DLTNLNNNYT NNYNPMRHSS GTFSASPAHQ  180
RSSIDMNMGV SQLNHNLGLS MEYGADQDMN TPMSGGGINM DMFSPAQYSN QMMNSPMQQA  240
PSHGVSQGSN VQQRQMSQES ASRSASIATG QLHPPSRTHS MHTNSEKSPT QLGPALNVSG  300
SLLAQQIGHN QFQTSSPQQN QAHNRSLNSD QYQSYQQQQE HSRTGSAQDH DMQQTTSDAP  360
STGFDGINGP VPINLTSYNP NNQGFQWETP EGGWPSTLVG RPHAQSTYKN AYSSTGFDML  420
GVLMRVATRP NPQINIGAVD LSCAFVVCDA EKDDTPIVYC SDNFERLTGY TKHMILGRNC  480
RFLQSPDGQV SPGIRRKYVD DDSVLYLKNM TNLQREAQIS LINYRRGGQP FMNLLTMIPI  540
TWDSDKIKFF VGFQVDLVEQ PNAVTDKNAD GSYSINYQRG MAMPPYVMPA SESVSKTELG  600
QTVSREDVST LLSTVGNGES EYAKRIWDKV LLENTDDVVH VLSLKGLFLW ISPSSARVLE  660
YEPSELMGTA LSSVCHPSDI VPVTRELKDT SSAASVNVVF RIRRKVSGYM WFEGHGSLHT  720
EQGKGRKSII MVGRERPVYT LSKKDLEGLG GIGESELWSK ISTSGMFLFV SSNVRQLLDR  780
QPDELVGISI QTLMRSESKA DFGRILEHAR TGMKASVKHE MINRRGQVLS AYTTVYPGDA  840
QEGQKPTFVL AQTRLIKFSR GAQMVRNNSG YSKSERNPSE SALATGTISA RQGGSGTSPN  900
SAVSSRHYRN TDGTAATYAG QFGLVIGHQD QSLASEDNLF DELKTTKSSS WQYEIRQLEK  960
RNRILAEEVQ SLIAAKKKRK RRKGAGNMQK DCANCHTKVT PEWRRGPSGQ RDLCNSCGLR  1020
WAKINGRVSP RTNSVYSSGA ASDKASKASA SPLHVKSEFP TPAPTPSAVP TSEGQEGQPP  1080
PSKAVRLSGD HSASMEGASG VPDSIREDVE PD
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1976982KKKRKRR
2977983KKKRKRR
3977982KKRKRR
4978983KRKRRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G36620.14e-09GATA family protein