PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC5BG020600.2
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family GATA
Protein Properties Length: 468aa    MW: 50638.7 Da    PI: 9.3778
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC5BG020600.2genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA35.51.4e-11110144135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +C++ +Tp+ R+gp g  tLCnaCG+ y k+g+
  TRIDC5BG020600.2 110 CLHCKAVETPQRRSGPMGRGTLCNACGVWYSKNGT 144
                       99******************************997 PP

2GATA32.61.1e-10200234135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +Cg+++ plW +g+ g +++C aCG++y+k +l
  TRIDC5BG020600.2 200 CLHCGSSEPPLWIEGSMGRREVCTACGMRYKKGRL 234
                       99******************************986 PP

3GATA55.11e-17304338135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C++Cg+++Tp+WR+gp g  tLCnaCG++yr  +l
  TRIDC5BG020600.2 304 CQHCGSSETPQWREGPKGRATLCNACGVRYRQGRL 338
                       *******************************9886 PP

Sequence ? help Back to Top
Protein Sequence    Length: 468 aa     Download sequence    
MAGGGGAGGK EDGGTEPLLL SVLALPATAL PAVVSRLEAA VPRKARSYLP RNVPSAWWSL  60
RIPFIQPLPP AGDPANEEEG RRFPRPQRVQ VAPSLDPGTA DKPPKRLKRC LHCKAVETPQ  120
RRSGPMGRGT LCNACGVWYS KNGTLPEHLP VSSPIVDSPL ENPIWEPEVP GAIYLVRKSA  180
TERMPPRTEA APAPRPGTSC LHCGSSEPPL WIEGSMGRRE VCTACGMRYK KGRLLPECRP  240
AECSVTDSRQ ESPVINSPPE SPIWEPEAPP SVHLPRKPSK KKKRRRSRSE APSAPWPANK  300
GKRCQHCGSS ETPQWREGPK GRATLCNACG VRYRQGRLLP EYRPMASPTF VPTKHANSHR  360
KVLQLHRTRQ SNDEHPSPLP ADSVTNLPPI RDELPTTSTA GLASEDPTDA PGYTDNPINV  420
PSSLDSLLLD GPSAPLIILM LGNFKLSITE GHCTGPSSDQ TSSVNSTI
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1280285KKKKRR
2280286KKKKRRR
3281287KKKKRRR
4281286KKKRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G32890.12e-30GATA family protein