PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OIW09140
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade; genistoids sensu lato; core genistoids; Genisteae; Lupinus
Family HD-ZIP
Protein Properties Length: 1321aa    MW: 146582 Da    PI: 6.4459
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OIW09140genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox55.59.9e-18571629357
               --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHC....TS-HHHHHHHHHHHHHHHHC CS
  Homeobox   3 kRttftkeqleeLeelFeknrypsaeereeLAkkl....gLterqVkvWFqNrRakekk 57 
               k  ++t+eq+e+Le+++ ++++ps  +r++L + +    +++ +q+kvWFqNrR +ek+
  OIW09140 571 KYVRYTAEQVEALERVYMECPKPSSLRRQQLIRDCpilsNIEPKQIKVWFQNRRCREKQ 629
               56789****************************************************97 PP

2START156.62e-497109172204
               HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-SEEEEEEEECTT..EEEEEEEEXXTT CS
     START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetlakaetlevissg..galqlmvaelqa 100
               +aee+++e+++ka+ ++  Wv+++ +++g+e++ +f+ s+++ g a+ra+g+v  +++  v+e+l+d++ W + +++ e+  ++  g  g+++l +++++a
  OIW09140 710 IAEETLTEFLSKATGTAVDWVQMPGMKPGPESVGIFAISQGCIGVAARACGLVSLEPT-KVAEILKDRLSWFRECRSLEVFTTVPAGngGTIELVYTQTYA 809
               789*******************************************************.8999999999****************9999************ PP

               XX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--....-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHH CS
     START 101 lsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe...sssvvRaellpSgiliepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwv 197
               +++l+p Rdf+++Ry+ +l+ g++v++++S++     p     +++vRae+l Sg+li+p+++g+s +++v+h +l+ ++++++l++l++s+ + ++k++ 
  OIW09140 810 PTTLAPaRDFWTLRYTTTLENGSLVVCERSLSGSGAGPDaaaAAQFVRAEVLSSGYLIRPCEGGGSIIHIVDHLNLQPWSVPEVLQPLYESSKVVAQKMTI 910
               ********************************99999988999********************************************************** PP

               HHTXXXX CS
     START 198 atlqrqc 204
               a+l++ +
  OIW09140 911 AALRYIR 917
               **99865 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1321 aa     Download sequence    
MSSESHSDQF HLPTLTSSLF DPEEDSSIHD PNGVVHGSSS AHSQPSDDVS IRAHTASSQP  60
STPSSSGYVA EIEDVALQNE IQELSIHGDD QYHNALAYSN HDELIGKPHR SDEDDASISW  120
RKRKKHFFVL SHSGKPIYSR YGDEHKLAGF SATLQAIISF VENGDFSKGQ TFFHPVFRGD  180
RVKLVRAGKH QVVFLVKGPI YLVCISCTEE PYESLVEQLE LIYGQMIVIL TKAVNKCFEK  240
NPKFDMTPLL GGTDIVFSSL IHSFSWNPAT FLHAYTCLPL AYATRQAADA ILQDVADSGV  300
LFAILMCRHK VISLVGAQKA TLHPDDMLLL ANFVMSSESF RQADTYLMLL TTSSDAFYHL  360
KDCRIHIEMV LLKSNVLSEV QRSLLDGGMR VEDLPPLPRF GSSQLGQNRL QLDSPDRLRE  420
PNSGIGGDAG LWHFLYRSIY LDQYVSSEFS SPINTPQQQK RLYRAYQKFF VSMHDKGIGP  480
HKTQFRRDEN YVLLCWVTQD FELYAAFDPL ADKHYPLSEK SYKYKNLLIV LFVKCVIVRL  540
VLKNMAMSGT QHRESNSSGG SSIDKHLDSG KYVRYTAEQV EALERVYMEC PKPSSLRRQQ  600
LIRDCPILSN IEPKQIKVWF QNRRCREKQR KEASRLHTVN RKLSAMNKLL MEENNRLQKQ  660
VSLLVCENGY MRQQLRIPSA RAADASCDSA VTTPQHYMRD ANTPAGFLSI AEETLTEFLS  720
KATGTAVDWV QMPGMKPGPE SVGIFAISQG CIGVAARACG LVSLEPTKVA EILKDRLSWF  780
RECRSLEVFT TVPAGNGGTI ELVYTQTYAP TTLAPARDFW TLRYTTTLEN GSLVVCERSL  840
SGSGAGPDAA AAAQFVRAEV LSSGYLIRPC EGGGSIIHIV DHLNLQPWSV PEVLQPLYES  900
SKVVAQKMTI AALRYIRQIA QETSGEVVYG LGRQPAVLRT FSQRLSRGFN HAVNGFNDDG  960
WSVVNCDGAE DIIISVNSTK NLSGTSNLAT PLTSLGGILC AKASMLLQNV PPAALIRFLR  1020
EHRSEWADFN IDAYSAASLK SGSYTYPGTR PTSFTGNQII MPLGHTIEHE EMLEVVRLEG  1080
HSLAQEDAFV SRDIHLLQIC SGIDENSVGP CSELIFAPID EMFPDDAPLV PSGFRIIPLD  1140
SKPGDKKDAT TGNRTLDLTS GLEVGLATSH AAGDASSCYT NRSVLTIAFQ FPFDSSLQDN  1200
VAGMALQYVR SVISSVQRVA MAISPSGIDP AAGLDMMETT LVALQDITLD KIFDESGRKA  1260
LFSDFAKIMQ QCKFLLHNNI SNVVAYDNDS ASKLNSFFYM YRCINLLLNF QVYNLTFPFD  1320
L
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1121125RKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G60690.10.0HD-ZIP family protein