PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OIW16300
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade; genistoids sensu lato; core genistoids; Genisteae; Lupinus
Family bHLH
Protein Properties Length: 1958aa    MW: 219636 Da    PI: 6.3698
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OIW16300genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH50.34.3e-16350396455
               HHHHHHHHHHHHHHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHHHHH CS
       HLH   4 ahnerErrRRdriNsafeeLrellPkaskapskKlsKaeiLekAveYIksLq 55 
                hn  ErrRRdriN+++  L++l+P++      K +Ka++Le+A+eY+ksLq
  OIW16300 350 VHNLSERRRRDRINEKMRALQQLIPNS-----NKTDKASMLEEAIEYLKSLQ 396
               6*************************7.....5******************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1958 aa     Download sequence    
MNNNIPSWNF ESDTFVTNQK RSIGPDQELV ELLWQNGQVV FNSQTQKKQV GNSSDMRQVQ  60
KNEGSTLRTS VPYGNSSNLS QEDETISWLQ YPLEDPLEQQ FSSNFLPEIA PFQVESYKPI  120
KQLEEENFAK IVQPSAPHAT SNSQSPNMKP SCVKEFQANP VPVPKFHVPD SSKQTNDLGR  180
TKKVPKFSGF SAPPNVSSVP ANALVGKKVS TNMSKNEGKE CSVMTVGSSH CGSNHIPQEP  240
GVSMVSSSGW ATTLSAETEA ARDYVQRTLP WSEKGKSEMV EPNLTSSSGG SGSSLGKTCS  300
LSTGNLGQKR KRTDAEESEE QSEATELKSG IGSKASQHAG SSRRNRAAEV HNLSERRRRD  360
RINEKMRALQ QLIPNSNKTD KASMLEEAIE YLKSLQLQLQ VMWMGGSMSP MMFPGIQHYM  420
SQMSMGMAAP SIQNSMQLPR VPIDQSMPQN PVLGAFNYQN QMQNSFLAEQ YARYMGCHLM  480
QSATQPMNAY RYGSQTLLQG QTMITPTTGA ANMDDPMSAK MGKPMNAYRY GSQTLLQGQT  540
MITPTTGAAN MDDPMSAKMG KCQKKISKLT NSRDCLRKAI KIQEQEINRL KKECEDERLR  600
TNTETEEKLK EYTARVSLEN QVSSLKSEIA KIQQKLDNDG VRDGNESIEG LQACLADKEK  660
EISELKELCE VEKIRAESER KNAEMEKKKV AEAQKLLEAE RSKERETSEL KEFFEAEKRR  720
AESEKKNAEK ERKKAAEAQK LLEAEKNNKV KEISQLKELL ETEKKRAESE RKSTEKEKKK  780
VAEAQKLLEA QKNKEEVISE LKELLEAEKK RAESWRNDVE KEKKKAAEAQ KLLEAQKNNK  840
EREISELMEL FEVEKERFES EKKNVEKEKK NASEARKLLE VEKNKNVEKG LQIARVEAEK  900
KMEEYRSQLG RLEKEVNETK AKLASKMYAF EEANKKFEAE KRKLLAEKRN LEMGMARANE  960
KLEGEKQKAN EERGRADSEV VNTEAQKGLA EDNWKKFMEE KGRADQMSQQ LEEDKRTIEG  1020
LKQKITELSS TRESIEMAGV TSDTVSKAES TKMKLLKSQL KLEKLRVKNA KQNFKLEASR  1080
HNILRHELGR LKIDSIQLVH RLDMLDASFS AVAESTHDYA KHDDLLYLQN SNVMRQVCNL  1140
DLSQMRSQFE NELRMQHILA LSGGNYSESI TGINSKSEPL VRGSNRTKLQ SSAVNSSSES  1200
FSDGQMMGSQ ETANNIPVTA SEKLNQEIFN ARQSLCNPFD KPVSEHHRKK RKGIHDIANL  1260
SSQNLPDLHG LFDERVDKCL EGGREMLHNP NNLQEKNDRA HKRRKKSHSE KVDMVPQMNG  1320
DGKTGREKSK AAAYQDSNVR RHTSCTAPDN LGTTLACGDM ICDAANDFDS IFFDKVADGN  1380
YMKLLELENA ADEEYFKRAM DSPLSPSLPE VLEEDMFCPR TDLFPPPSSN VINAEIISNE  1440
QTFNVYGVSS NLKNKPAQAS EHELVKLSHM STPEKSRDTQ LVEGGSGLSS KSVPDSTKLC  1500
FSFREKASVL LTLMLFNFVT VATMTFGKLW DGNLFPCMNS YAEHICTVMS DPEARILLLE  1560
NCSLQELLGL IEDFLIEGKV IVNNEVPAET LSDCDLRKNG DLDCATKFSS DVASSEQLVA  1620
GSIILASICA ATNHFGFLCE ASYDILRLCN WNSLVVLTIL HIFAYLSGEK FFVLDNFRLM  1680
ITVLKSLVMF LEGENLSVAP ASCLPSIDQL HTEFCVNAKC QFLEGAEPID IVACLLLEEI  1740
ESCWLQGIEQ GDLSDSRFTT DDHHAGQWSN QEGIQSLIST NCDVSCCLKR CMISATQPHA  1800
RKSSTFCHLG DVLSLVELVA NKMSWPWTDS KFIPQLLNML KSCVEENFVI AIMALLGQLG  1860
RIGVIAGGYG DRGVENLRCN LFAYLSRTTS MKCLSLQIAT ATALFGLLPF DPESLFHTNI  1920
SLPAYLKSVS DDAETLRKWF SGLDKDQQKL LSGILKPQ
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1355360ERRRRD
213021308KRRKKSH
313021330KRRKKSHSEKVDMVPQMNGDGKTGREKSK
413031331KRRKKSHSEKVDMVPQMNGDGKTGREKSK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G43010.23e-51bHLH family protein