PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PGSC0003DMP400033095
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum
Family HD-ZIP
Protein Properties Length: 569aa    MW: 63905.8 Da    PI: 6.78
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PGSC0003DMP400033095genomePGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox27.74.5e-092273156
                          HHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox 31 eeLAkklgLterqVkvWFqNrRakek 56
                          +eL++kl+L+  q+k+WFqNrR++ k
  PGSC0003DMP400033095  2 KELSNKLELEPLQIKFWFQNRRTQIK 27
                          79********************9988 PP

2START140.12.3e-441043084204
                           HHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EE CS
                 START   4 eeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....ka 79 
                            +a++el+++a+ +ep+W          +n++e+ +kf++++       ++ a+r+s++v+m++ +lve+++d++  W + +      a
  PGSC0003DMP400033095 104 RAAMYELLQMAQMGEPLWLPNIdgvnNDLNEEEYKRKFPRGNEpkpngIKTTASRESVLVTMNHINLVEIFMDTN-HWARFFSsivlTA 191
                           579*****************99999999***********99989**9999*************************.*******555555 PP

                           EEEEEECTTEEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEE CS
                 START  80 etlevissggalqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwv 167
                            t++v++  g    ++ae+q++sp  p Rd +fvR++ +  +g wvivdvS+d++   p    + R+ ++pSg++i++ sn  skvtwv
  PGSC0003DMP400033095 192 RTMDVLD--GSTKMIYAEFQVPSPQIPnRDCYFVRSCNKIVDGLWVIVDVSLDHT---P----ITRCWKRPSGCVIKQISNDISKVTWV 271
                           5555554..7888999************************************986...3....57************************ PP

                           E-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXX CS
                 START 168 ehvdlkgrlphwllrslvksglaegaktwvatlqrqc 204
                           eh++ ++ l+h  ++ +v+s+la+gak+w + l+rqc
  PGSC0003DMP400033095 272 EHIEADDTLVHTFYKTFVNSSLAFGAKRWISILDRQC 308
                           ************************************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
CDDcd000864.48E-6129No hitNo description
PROSITE profilePS5007111.418129IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.601.3E-8233IPR009057Homeodomain-like
SuperFamilySSF466897.27E-7230IPR009057Homeodomain-like
PfamPF000461.9E-6227IPR001356Homeobox domain
PROSITE patternPS000270427IPR017970Homeobox, conserved site
PROSITE profilePS5084831.14692313IPR002913START domain
SuperFamilySSF559612.47E-3098310No hitNo description
SMARTSM002343.7E-27101310IPR002913START domain
PfamPF018526.6E-36104308IPR002913START domain
CDDcd088755.75E-87106308No hitNo description
Gene3DG3DSA:3.30.530.201.5E-7169290IPR023393START-like domain
SuperFamilySSF559613.3E-14331522No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 569 aa     Download sequence    Send to blast
MKELSNKLEL EPLQIKFWFQ NRRTQIKNQD QHSENLSLRA ENDKLRAECV WLSEAINNGC  60
PNCGDHGFRL GETPNNEQYL RLENARLQEE VVHISRQYIA KVIRAAMYEL LQMAQMGEPL  120
WLPNIDGVNN DLNEEEYKRK FPRGNEPKPN GIKTTASRES VLVTMNHINL VEIFMDTNHW  180
ARFFSSIVLT ARTMDVLDGS TKMIYAEFQV PSPQIPNRDC YFVRSCNKIV DGLWVIVDVS  240
LDHTPITRCW KRPSGCVIKQ ISNDISKVTW VEHIEADDTL VHTFYKTFVN SSLAFGAKRW  300
ISILDRQCGR LASAEATNLP QSNIIHTLST GRKSALKLGE RMIIDYISGV SGTTTHQWTT  360
FTRSGYNTND VQVMTRQSIN DPGRPRGLVL CASTSIWLPV LPKLVFDFLG NENTRGKWDI  420
LSNVGTIQQV THIANGTEIG NSISILRVNS PNPAQNDMLI FQESITDPTG SFIVYAPIDI  480
RAIDMVLCGG NPDGVPLLPS GFAIFPDGPS SSTNYEISDY SGSFLTISFQ ILVHNVPTAN  540
ISPQSIASVN KLMFCTIDKI KNALFLNF*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapPGSC0003DMP400033095
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754461e-144HG975446.1 Solanum pennellii chromosome ch07, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006342404.10.0PREDICTED: homeobox-leucine zipper protein ROC7-like
SwissprotA2YR020.0ROC7_ORYSI; Homeobox-leucine zipper protein ROC7
SwissprotA3BPF20.0ROC7_ORYSJ; Homeobox-leucine zipper protein ROC7
TrEMBLM1BN380.0M1BN38_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000489710.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA1721855
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G04890.10.0protodermal factor 2
Publications ? help Back to Top
  1. Xu X, et al.
    Genome sequence and analysis of the tuber crop potato.
    Nature, 2011. 475(7355): p. 189-95
    [PMID:21743474]