PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID BGIOSGA000782-PA
Common NameOsI_03822
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa
Family HD-ZIP
Protein Properties Length: 759aa    MW: 82008 Da    PI: 6.0196
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
BGIOSGA000782-PAgenomeRISView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox20.58.1e-0796127132
                       TT--SS--HHHHHHHHHHHHHSSS--HHHHHH CS
          Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeeree 32 
                       r++ +++t+eq++++e+lF+++++p++++r++
  BGIOSGA000782-PA  96 RKNYHRHTAEQIRIMEALFKESPHPDERQRQQ 127
                       788999************************98 PP

2START174.85.2e-552554812206
                       HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.........SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....E CS
             START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv........dsgealrasgvvdmvlallveellddkeqWdetla....k 78 
                       la +a++elv + +++ep+Wv+ +    +++n+de+++ f ++++         ++ea+r++g+v  ++++lv +++d+  +W+  ++    k
  BGIOSGA000782-PA 255 LATRALDELVGMCSSGEPVWVRGVetgrDILNYDEYVRLFRRDHGgsgdqmagWTVEASRECGLVYLDTMQLVHTFMDVD-KWKDLFPtmisK 346
                       78899*************************************999***********************************.99998888888* PP

                       EEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEE CS
             START  79 aetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskv 164
                       a+tle is+       g+lqlm+aelq l+p+vp R+ +f+Ry+++l a++w+ivdvS d+ +     ss vR+ + pSg+lie+  ng++k+
  BGIOSGA000782-PA 347 AATLEMISNReddgrdGVLQLMYAELQTLTPMVPtRELYFARYCKKLAAERWAIVDVSFDESETGVHASSAVRCWKNPSGCLIEEQNNGRCKM 439
                       ********99999***********************************************9999998************************** PP

                       EEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
             START 165 twvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                       twveh+ ++  ++  l+r +  sg+a+ga++wva+lq qce+
  BGIOSGA000782-PA 440 TWVEHTRCRRCTVAPLYRAVTASGVAFGARRWVAALQLQCER 481
                       ****************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.23E-579127IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.5E-882127IPR009057Homeodomain-like
CDDcd000867.08E-493127No hitNo description
PROSITE profilePS5084840.04245484IPR002913START domain
SuperFamilySSF559617.42E-32246481No hitNo description
CDDcd088752.15E-111249480No hitNo description
SMARTSM002341.1E-46254481IPR002913START domain
PfamPF018522.3E-45255481IPR002913START domain
Gene3DG3DSA:3.30.530.202.1E-4293445IPR023393START-like domain
SuperFamilySSF559614.81E-5503608No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009957Biological Processepidermal cell fate specification
GO:0010062Biological Processnegative regulation of trichoblast fate specification
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 759 aa     Download sequence    Send to blast
MGTNRPRPRT KDFFAAPALS LTLAGVFGRK NGPAASGGDG VEEGDEEVQA AGEAAVEISS  60
ENAGPGCSQS QSGGGSGEDG GHDDDDGEGS NKKRRRKNYH RHTAEQIRIM EALFKESPHP  120
DERQRQQAVQ ERHENSLLKS ELEKLQDEHR AMRELAKKPS RCLNCGVVAT SSDAVAAATA  180
ADTREQRLRL ENAKLKAEIE RLRGTPGKSA ADGVASPPCS ASAGAMQTNS RSPPLHDHDG  240
GFLRHDDDKP RILELATRAL DELVGMCSSG EPVWVRGVET GRDILNYDEY VRLFRRDHGG  300
SGDQMAGWTV EASRECGLVY LDTMQLVHTF MDVDKWKDLF PTMISKAATL EMISNREDDG  360
RDGVLQLMYA ELQTLTPMVP TRELYFARYC KKLAAERWAI VDVSFDESET GVHASSAVRC  420
WKNPSGCLIE EQNNGRCKMT WVEHTRCRRC TVAPLYRAVT ASGVAFGARR WVAALQLQCE  480
RMVFAVATNV PTRDSTGVST LAGRRSVLKL AHRMTSSLCR TTGGSRDMAW RRAPKGGSGG  540
GGDDDIWLTS RENAGDDPGE PQGLIACAAA STWLPVNPTA LLDLLRDESR RPEWDVMLPG  600
KSVQSRVNLA KGKDRTNCVT AYAARPEEEE ERGGKWVLQD VCTNPCESTI AYAAIDAAAL  660
QPVIAGHDSS GVHLLPCGFI SVMPDGLESK PAVITASRRG GEASGAGSLV TVAFQVPASP  720
SAAAATLSPD SVEAVTVLVS STLRNIRKAL GCDSCEEEF
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
19097NKKRRRKN
29196KKRRRK
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Os.753530.0panicle
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAP0032560.0AP003256.3 Oryza sativa Japonica Group genomic DNA, chromosome 1, PAC clone:P0460E08.
GenBankAP0032740.0AP003274.4 Oryza sativa Japonica Group genomic DNA, chromosome 1, PAC clone:P0512C01.
GenBankAP0149570.0AP014957.1 Oryza sativa Japonica Group DNA, chromosome 1, cultivar: Nipponbare, complete sequence.
GenBankCP0126090.0CP012609.1 Oryza sativa Indica Group cultivar RP Bio-226 chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015644459.10.0homeobox-leucine zipper protein ROC9
SwissprotQ5JMF30.0ROC9_ORYSJ; Homeobox-leucine zipper protein ROC9
TrEMBLB8A9T30.0B8A9T3_ORYSI; Uncharacterized protein
STRINGONIVA01G36080.10.0(Oryza nivara)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP79938147
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G79840.10.0HD-ZIP family protein