PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sof003489
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Saccharinae; Saccharum; Saccharum officinarum complex
Family HD-ZIP
Protein Properties Length: 244aa    MW: 26678.1 Da    PI: 5.2261
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PUT-157a-Saccharum_officinarum-22994PU_refplantGDBView CDS
PUT-157a-Saccharum_officinarum-22996PU_unrefplantGDBView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox61.51.3e-193387256
               T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
   Homeobox  2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
               +++ +f++eq++ Le++F+ + ++  +++ +LA++lgL+ rqV +WFqN+Ra++k
  Sof003489 33 KNKKRFSEEQIKSLESMFATQTKLEPRQKLQLARELGLQPRQVAIWFQNKRARWK 87
               67789*************************************************9 PP

2HD-ZIP_I/II121.83.5e-3934125293
  HD-ZIP_I/II   2 kkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreelke 93 
                  +k+r+s+eq+k+LE++F +++kLep++K +lareLglqprqva+WFqn+RAR+k+kqlE++y+aL++ ydal  + e+L+ke+++L ++l++
    Sof003489  34 NKKRFSEEQIKSLESMFATQTKLEPRQKLQLARELGLQPRQVAIWFQNKRARWKSKQLEREYSALRDDYDALLCSYESLKKEKHALLKQLEK 125
                  79*************************************************************************************99986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.6E-192090IPR009057Homeodomain-like
SuperFamilySSF466894.28E-192491IPR009057Homeodomain-like
PROSITE profilePS5007117.3292989IPR001356Homeobox domain
SMARTSM003891.9E-163293IPR001356Homeobox domain
PfamPF000465.3E-173387IPR001356Homeobox domain
CDDcd000865.74E-173390No hitNo description
PRINTSPR000318.0E-66069IPR000047Helix-turn-helix motif
PROSITE patternPS0002706487IPR017970Homeobox, conserved site
PRINTSPR000318.0E-66985IPR000047Helix-turn-helix motif
PfamPF021836.1E-1389130IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 244 aa     Download sequence    Send to blast
MDGEDDVPEW MMEVGGAGGK GGKGXXGGGP LDKNKKRFSE EQIKSLESMF ATQTKLEPRQ  60
KLQLARELGL QPRQVAIWFQ NKRARWKSKQ LEREYSALRD DYDALLCSYE SLKKEKHALL  120
KQLEKLAEML HEPRGKYGGN ADAGAGDDVR SGVGGMKEEF TDAGGAALYS SEGGGGGGGK  180
LAHFTDDDVG ALFRPSPQPT AAGFTSSGPP EHQPFQFHSS CWPSSTTEQT CSSSQWWEFE  240
SLSE
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
12028GGKGXXGGG
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Sof.115430.0crown| inflorescence| root| seed
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in seedlings, roots, leaves, nodes, internodes, flowers and embryo. {ECO:0000269|PubMed:10732669, ECO:0000269|PubMed:17999151}.
UniprotTISSUE SPECIFICITY: Expressed in seedlings, roots, leaves, nodes, internodes, flowers and embryo. {ECO:0000269|PubMed:10732669, ECO:0000269|PubMed:17999151}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds to the DNA sequence 5'-CAAT[AT]ATTG-3'. {ECO:0000269|PubMed:10732669}.
UniProtProbable transcription factor that binds to the DNA sequence 5'-CAAT[AT]ATTG-3'. {ECO:0000269|PubMed:10732669}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002462710.11e-155homeobox-leucine zipper protein HOX6
SwissprotQ651Z55e-98HOX6_ORYSJ; Homeobox-leucine zipper protein HOX6
SwissprotQ9XH355e-98HOX6_ORYSI; Homeobox-leucine zipper protein HOX6
TrEMBLC5X6571e-153C5X657_SORBI; Uncharacterized protein
STRINGSb02g030660.11e-154(Sorghum bicolor)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G46680.11e-42homeobox 7
Publications ? help Back to Top
  1. Kikuchi S, et al.
    Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice.
    Science, 2003. 301(5631): p. 376-9
    [PMID:12869764]