PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID NNU_020098-RA
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; stem eudicotyledons; Proteales; Nelumbonaceae; Nelumbo
Family HD-ZIP
Protein Properties Length: 811aa    MW: 89525.3 Da    PI: 6.2355
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
NNU_020098-RAgenomeCASView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox64.51.5e-20125180156
                    TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
       Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                    +++ +++t+ q++e+e+lF+++++p+ ++r +L+++lgL+ rqVk+WFqNrR+++k
  NNU_020098-RA 125 KKRYHRHTARQIQEMEALFKECPHPDDKQRMKLSQELGLKPRQVKFWFQNRRTQMK 180
                    688899***********************************************998 PP

2START166.32.1e-523375622206
                    HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS..............SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEE CS
          START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv.............dsgealrasgvvdmvlallveellddkeqWdetla....kae 80 
                    la +a++elvk+  a++p+Wv+ss  + g evl  +e ++              +++ea r+s++v+m++ +lv  +ld + +W e ++    ka 
  NNU_020098-RA 337 LAMAAMDELVKMCHATDPLWVRSS--NGGREVLNLEEHARMfpwpmnvkqhnteFRIEATRDSALVIMNSITLVDAFLDAN-KWVELFPsivsKAR 429
                    7889********************..777777777766666777888899999****************************.************** PP

                    EEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE..TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE CS
          START  81 tlevissg......galqlmvaelqalsplvp.RdfvfvRyirq.lgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwve 168
                     ++vi+sg      g l lm+ae+q  splvp R+  f+Ry++q +++g+w+ivd  +d+  ++  ++s+ R +++pSg++ie+++ng+s+vtwve
  NNU_020098-RA 430 NVQVITSGvsghasGSLLLMYAEFQIQSPLVPtREAHFLRYCQQnMEEGTWAIVDFPIDNFHDNL-QASFPRYRRRPSGCIIEDMPNGYSRVTWVE 524
                    ***************************************************************98.9***************************** PP

                    -EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
          START 169 hvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                    +++ +++ +h++++++v+sg+a+ga++w+a lqrqce+
  NNU_020098-RA 525 QAEIEDKPVHQIFNHFVNSGMAFGAQRWLAVLQRQCER 562
                    ************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466894.6E-20110183IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.5E-21112176IPR009057Homeodomain-like
PROSITE profilePS5007117.216122182IPR001356Homeobox domain
SMARTSM003892.7E-19123186IPR001356Homeobox domain
PfamPF000465.1E-18125180IPR001356Homeobox domain
CDDcd000861.23E-18125183No hitNo description
PROSITE patternPS000270157180IPR017970Homeobox, conserved site
PROSITE profilePS5084846.46327565IPR002913START domain
SuperFamilySSF559611.73E-33330564No hitNo description
CDDcd088759.62E-120331561No hitNo description
SMARTSM002345.9E-40336562IPR002913START domain
PfamPF018523.7E-44337562IPR002913START domain
SuperFamilySSF559611.81E-13579782No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 811 aa     Download sequence    Send to blast
NKKRKKWRSG SSDAKKNHLI VSFFFWVNFC LVARMYGGGD CQVLSSLGGN VVSADPLFSS  60
PIRNPNLNFM ANLPFHPFSA IIPKEEIGIA KGKDEEMERS GSGSGHAEGP SLSGDEQENE  120
QQPKKKRYHR HTARQIQEME ALFKECPHPD DKQRMKLSQE LGLKPRQVKF WFQNRRTQMK  180
AQQDRADNVI LRTENENLKN ENFRLQAALR NIICPNCGGP AILGEVSFDE QHLRLENAKL  240
KEEVLFTDNE NIELLLQLER ISCVASRYSG RSIQALAPAP PLLLPSLDLD MGIYSRHFHE  300
PISNCTDIVP VAPLPENPQF AGGCMNMEQE KPLALELAMA AMDELVKMCH ATDPLWVRSS  360
NGGREVLNLE EHARMFPWPM NVKQHNTEFR IEATRDSALV IMNSITLVDA FLDANKWVEL  420
FPSIVSKARN VQVITSGVSG HASGSLLLMY AEFQIQSPLV PTREAHFLRY CQQNMEEGTW  480
AIVDFPIDNF HDNLQASFPR YRRRPSGCII EDMPNGYSRV TWVEQAEIED KPVHQIFNHF  540
VNSGMAFGAQ RWLAVLQRQC ERFASLMARN ISDLGDTADD TVRITTRKNT EPGQPNGTIL  600
GAVSTSWLPF PCHQVFDLLR DERRRSQLDV LSSGNSLHEV AHIANGSHPG NCISLLRINA  660
SSNSSQNVEL MLQESCTDTS GSLIVYSTMD VDAVQLAMSG EDPSYIPLLP IGFVVVPAGH  720
SSVSCNNGNG VPSPEEGNGQ APAGCLLTVG LQVLASTIPS AKLNLSSVTA VNNHICNAVH  780
QINAALSNSS NNSVGGGVSG GSSTEPAAAT D
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
115KKRKK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010276832.10.0PREDICTED: homeobox-leucine zipper protein ROC3-like isoform X1
RefseqXP_010276833.10.0PREDICTED: homeobox-leucine zipper protein ROC3-like isoform X1
RefseqXP_010276834.10.0PREDICTED: homeobox-leucine zipper protein ROC3-like isoform X1
RefseqXP_010276835.10.0PREDICTED: homeobox-leucine zipper protein ROC3-like isoform X1
RefseqXP_010276836.10.0PREDICTED: homeobox-leucine zipper protein ROC3-like isoform X1
RefseqXP_019055645.10.0PREDICTED: homeobox-leucine zipper protein ROC3-like isoform X1
RefseqXP_019055646.10.0PREDICTED: homeobox-leucine zipper protein ROC3-like isoform X1
SwissprotA2ZAI70.0ROC3_ORYSI; Homeobox-leucine zipper protein ROC3
TrEMBLA0A1U8BJ910.0A0A1U8BJ91_NELNU; homeobox-leucine zipper protein ROC3-like isoform X1
STRINGXP_010276832.10.0(Nelumbo nucifera)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G46880.10.0homeobox-7