PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OGLUM02G32070.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Oryzoideae; Oryzeae; Oryzinae; Oryza
Family MYB
Protein Properties Length: 2082aa    MW: 232303 Da    PI: 7.3093
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OGLUM02G32070.1genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding41.62.9e-1318421889148
                       TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
  Myb_DNA-binding    1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48  
                       +g+WT++Ed++lv      G  +W+++++  g+ R++k+c++rw +yl
  OGLUM02G32070.1 1842 KGPWTADEDQKLVTFLLSNGHCCWRLVPKLAGLLRCGKSCRLRWTNYL 1889
                       79******************************99************97 PP

2Myb_DNA-binding54.13.6e-1718981939447
                       S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
  Myb_DNA-binding    4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47  
                        ++eE++l +d+++qlG++ W++Ia++++ gRt++++k++w+++
  OGLUM02G32070.1 1898 LSEEEEKLVIDLHEQLGNR-WSKIAARLP-GRTDNEIKNHWNTH 1939
                       599****************.*********.************97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2082 aa     Download sequence    
MQLRLHIPPS VDLFPFPQTA RAHQVRSATL LNPRSEQAHR TTPVAGDGSG ASAMDGAHGQ  60
RQPMSPAISA SAVLPQQRQM QLHHHHHHPA RSAIADLFTL YLGMNSKQRI EDPTRETSNK  120
LQKRVTAMNR DLPPRDEQFI SDFEQLHMQF PDQEQLQAVT ESVLISFVLQ CSSHAPQSEF  180
LLFATRCLCA RGHLRWDSLL PSLLNVVSSV EVPMGQGVSV TTGGPATSSS SAIAVPNAPS  240
FHPSNPTSPL SAMNTIGSPT QSGIDQPIGA NVSPIKGAEF SSPGQLGLTA RGDQSRRGAE  300
ISYLHHLSCR IILAGLESDL KPATHAVIFQ HMVNWLVNWD QRPHGVDQAD ALQLQTLRLE  360
RPLHEWMHLC LDVIWILVNE DKCRVPFYEL VRSNLQFLEN IPDDEALVSI IMEIHRRRDM  420
VCMHMQMLDQ HLHCPTFATH RFLSQSYPSI AGESVANLRY SPITYPSVLG EPLHGEDLAN  480
SIPKGGLDWE RALRCLRHAL CTTPSPDWWR RVLLVAPCYR QHPQQSSTPG AVFSPDMIGE  540
AVADRTIELL RLTNSETQCW QDWLLFADIF FFLMKSGCID FLDFVDKLAS RVTNSDQQIL  600
RSNHVTWLLA QIIRIEIVMN TLSSDPRKVE TTRKIISFHK EDKSLDPNNI SPQSILLDFI  660
SSSQTLRIWS FNTSIREHLN SDQLQKGKQI DEWWKQMTKA SGERMIDFTS LDERAMGMFW  720
VLSFTMAQPA CEAVMNWFTS VGVADLIQGP NLQPNERMTM MRETYPLSMS LLSGLSINLC  780
LKLAFQLEET IFLGQNVPSI AMVETYVRLL LITPHSLFRP HFTTLTQRSP SILSKSGVSL  840
LLLEILNYRL LPLYRYHGKS KALMYDVTKI ISMIKVKRGE HRLFRLAENL CMNLILSLRD  900
FFLVKKELKG PTEFTETLNR ITIISLAITM KTRGIAEVEH IIYLQPLLEQ IMATSQHTWS  960
EKTLRYFPPL IRDFLMGRMD KRGQAIQAWQ QAETTVINQC NQLLSPSAEP TYVMTYLSHS  1020
FPQHRQYLCA GAWMLMNGHL EINSANLARV LREFSPEEVT ANIYTMVDVL LHHIQLELQR  1080
GHQIQDLLSK AITNLAFFIW THELLPLDIL LLALIDRDDD PYALRLVINL LERPELQQRI  1140
KAFCTSRSPE HWLKNQPPKR VELQKALGNH LSGKERYPPF FDDIAARLLP VIPLIIYRLI  1200
ENDATDIADR VLAVYSTFLA FHPLRFTFVR DILAYFYGHL PSKLIVRILN VLGVSTKTPF  1260
SESFAQYLAS SNSSICPPPE YFANLLFGLV NNVIPPLSCK SKSNPSDAAG STARTTYNKP  1320
HTSSAGGISN SDGQRAFYQN QDPGSYTQLV LETAAIEILS LCVPASQIVS SLVQIIAHVQ  1380
AMLIQSNSGH GMSGGLGQNS GVPTSSGGGV EPVGANRPNT TASGINASNF VSRSGYSCQQ  1440
LSVLMIQACG LLLAQLPPEF HTLLYAEAAR IIKDCWWLAD SSRPVKELDS AVGYALLDPT  1500
WASQDNTSTA IGNIVALLHS FFSNLPHEWL ESTHTVIKHL RPVNSVAMLR IAFRILGPLL  1560
PRLAFARPLF MKTLALLFNV LGDVFGKNSQ ASPPVEASEI ADIIDFLHHA VMYEGQGGPV  1620
QSTSKPKLEI LTLCGKVMEI LRPDVQHLLS HLKTDPNSSV YAATHPKLLP PHSCTADYGR  1680
QYYIEVGFVN AEGSWLYTII GSTQADLIKH NKGERAVTQL LKDVDPLQTA VVLTLRRQRP  1740
RRELSPDYTC TRNSILEAWR ARRLAADVSD GLTPSRLRLA SAFPLLAASI NPRAAPEPLQ  1800
IKTRVSSSGS VHPSREIQIA SQIRDQRVMG RQPCCEKVGL KKGPWTADED QKLVTFLLSN  1860
GHCCWRLVPK LAGLLRCGKS CRLRWTNYLR PDLKRGLLSE EEEKLVIDLH EQLGNRWSKI  1920
AARLPGRTDN EIKNHWNTHI KKKLKKMGLD PVTHRPVMSL AQPDPLKQQQ QQQEPSVSGG  1980
TGADDKEEEE ETPTSAQPQG VACAASSASA VSSSCSSSAS ASAATPGADV DWPGLFEVDA  2040
ILDIDWAGLL SACGDDGGCS AIGVDMLFDQ CSDVGFDQDV WM
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
117361743RRQRPRRE
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G66230.15e-79MYB family protein