PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OIW20604
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade; genistoids sensu lato; core genistoids; Genisteae; Lupinus
Family HD-ZIP
Protein Properties Length: 973aa    MW: 108900 Da    PI: 7.2936
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OIW20604genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox62.46.8e-203287156
              TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
  Homeobox  1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
              +++ +++t++q+++Le++F+++++p++++r +L+++lgL  rq+k+WFqNrR++ k
  OIW20604 32 KKRYHRHTANQIQRLESMFKECPHPDEKQRMQLSRELGLAPRQIKFWFQNRRTQIK 87
              688999**********************************************9877 PP

2START176.81.3e-552324562206
               HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECTT CS
     START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevissg 88 
               +a  a++el+++ + +ep+W+kss    + +n d++ + f++ ++       ++ea+r+sgvv+m+  +lv  ++d + +W e ++     a t ev+ssg
  OIW20604 232 IASNAMEELIRLLQTNEPLWMKSStdgrDVLNIDTYERMFPKPNShsknpnVRIEASRDSGVVIMNGLTLVDMFMDPN-KWMELFPtivtMARTFEVLSSG 331
               67899*******************9999888888888888877778999999**************************.999999999999********** PP

               ......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXX.HHHH CS
     START  89 ......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlp.hwll 181
                     g+lqlm+ e+q+lsplv+ R+f+f+Ry++q ++g w+ivdvS d +q+++   +  R+++lpSg+ i++++ng+s+vtwvehv++++++p h l+
  OIW20604 332 iigghsGTLQLMYEEMQVLSPLVStREFYFLRYCQQIEQGLWAIVDVSYDFPQDNQ-FVPQFRSHRLPSGCFIQDMPNGYSQVTWVEHVEVEDKTPvHRLY 431
               *******************************************************9.7999**************************************** PP

               HHHHHHHHHHHHHHHHHHTXXXXXX CS
     START 182 rslvksglaegaktwvatlqrqcek 206
               r l+ sgla+ga +w++ lqr ce+
  OIW20604 432 RNLLYSGLAFGAHRWLSNLQRMCER 456
               ***********************97 PP

3START139.24.3e-4454971061206
               HHHHHCCCGG.....CT-TT-S....EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE- CS
     START  61 lveellddke.....qWdetla....kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRa 145
                v  +++d +     +W e ++     a t ev+ssg      g+lqlm+ e+q+lsplv+ R+f+f+Ry++q ++g w+ivdvS d +q+++   +  R+
  OIW20604 549 IVFNFFKDERkrpqnKWMELFPtivtMARTFEVLSSGiigghsGTLQLMYEEMQVLSPLVStREFYFLRYCQQIEQGLWAIVDVSYDFPQDNQ-FVPQFRS 648
               444555555555669999999999989*****************************************************************9.7999*** PP

               EESSEEEEEEEECTCEEEEEEEE-EE--SSXX.HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
     START 146 ellpSgiliepksnghskvtwvehvdlkgrlp.hwllrslvksglaegaktwvatlqrqcek 206
               ++lpSg+ i++++ng+s+vtwvehv++++++p h l+r l+ sgla+ga +w++ lqr ce+
  OIW20604 649 HRLPSGCFIQDMPNGYSQVTWVEHVEVEDKTPvHRLYRNLLYSGLAFGAHRWLSNLQRMCER 710
               ************************************************************97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 973 aa     Download sequence    
MEYGVGDGGG SSGNHHHHHH DGGSSDSQRR KKKRYHRHTA NQIQRLESMF KECPHPDEKQ  60
RMQLSRELGL APRQIKFWFQ NRRTQIKAQH ERADNCTLRA ENDRIRCENI AYKEALKNMF  120
CPSCGGPPLH EDPYVDEQKL RMDNAQLKEE LERVSSIAAK YIGRPISQLP SIQPIHMSSL  180
DLSMGSFVTQ GLSVGPSLDL DLLPGNGTSS SMQNVPYQPT LSDMDNSLMS DIASNAMEEL  240
IRLLQTNEPL WMKSSTDGRD VLNIDTYERM FPKPNSHSKN PNVRIEASRD SGVVIMNGLT  300
LVDMFMDPNK WMELFPTIVT MARTFEVLSS GIIGGHSGTL QLMYEEMQVL SPLVSTREFY  360
FLRYCQQIEQ GLWAIVDVSY DFPQDNQFVP QFRSHRLPSG CFIQDMPNGY SQVTWVEHVE  420
VEDKTPVHRL YRNLLYSGLA FGAHRWLSNL QRMCERIACL MVSGNSTRDL GSVIPSAEGK  480
RSMMKLAQRM ITNFSASIST SCSNRWTTLS GLNEIGVRVI VHNSSHPGQP NGVVLSAATT  540
IWLPIPPQIV FNFFKDERKR PQNKWMELFP TIVTMARTFE VLSSGIIGGH SGTLQLMYEE  600
MQVLSPLVST REFYFLRYCQ QIEQGLWAIV DVSYDFPQDN QFVPQFRSHR LPSGCFIQDM  660
PNGYSQVTWV EHVEVEDKTP VHRLYRNLLY SGLAFGAHRW LSNLQRMCER IACLMVSGNS  720
TRDLGSVIPS AEGKRSMMKL AQRMITNFSA SISTSCSNRW TTLSGLNEIG VRVIVHNSSH  780
PGQPNGVVLS AATTIWLPIP PQIVFNFFKD ERKRPQWDVL SNGNAVQEVA HIANGSNPGN  840
CISVLRAFNT SQNSMLILQE SCIDSSGSLV VYCPVELSAI NIAMSGEDPS YIPLLPSGFT  900
IAPDGQTDQG HGDGASTSTN KNSSGGSLVT VAFQILVSSL PSSKLNKESV NTINNLIGTT  960
VQQIKAALNC HSS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
12934RRKKKR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G73360.10.0HD-ZIP family protein