PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP003675.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family bHLH
Protein Properties Length: 1210aa    MW: 136442 Da    PI: 8.7098
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP003675.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH26.41.2e-08155203455
                  HHHHHHHHHHHHHHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHHHHH CS
          HLH   4 ahnerErrRRdriNsafeeLrellPkaskapskKlsKaeiLekAveYIksLq 55 
                  +h+ +Er RR +iN+++  L+ ++P +   ++k +  a +L + ++Y++sLq
  PCP003675.1 155 SHSLAERVRRGKINERLRCLQNIVPGC---SNKTMGMAVMLDEIINYVQSLQ 203
                  8**************************...9999*****************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1210 aa     Download sequence    
MSEFTEDFQS IKPSFPFLDI DPNMELVNQF ADQFNIPSVM EYPSLNFHTY MPFSNDSYLF  60
SNQEPEFPAG NMVENFPKLE VHQIVADGNE SKKRIAVDDA SQSSSGISTP PVSETGITRK  120
NSAGRGKRVK SNEKEDEKPK EVVHVRARRG QATDSHSLAE RVRRGKINER LRCLQNIVPG  180
CSNKTMGMAV MLDEIINYVQ SLQNQIEFLS MRLAAASSFY DFNTSTDVTE TMQIAAELER  240
LKREAAGYGG VACLALYLFS SSFDHGCAYE KWETTALPGC TNDDFCLALM AATSNSHVAV  300
AAYLLPQLPH DFHANKSIGI LKPLSCSHCL PLYRECVKAQ GSRLVSLRLR GRLVEAVRRL  360
GAGVFDGVSQ ALLFWSIVFL KDHSTSKLRM KERRGKREDV EIGDGRERGK ERKKARKMWE  420
GWGKDGGRWG KRKKQRVEKE KKEGRMARMR FWFGACNLGW VLRNVAEIWR ISFAKAEIAE  480
KAAVFLNVHF VVEKCNIVKS GSSVKFKNCD IYEGSFKWLL GNRSPYDEEL EELERSPSAR  540
TNWVPELSPI ANIVVRRCSK ILGVPTTELR EGFNSEASES IKHPSCYARN FLEYCCFRAL  600
ALSTQVTGHL ADKKFRRLTY DMMIAWEAPA AASQPLLNLD EDLSVGVEAF SRIAPAVPII  660
ANVITSENIF KVLASSSDGR LQFFTYDKYL SGLERAIRKM RSQSESSLLS AMRSSKREKI  720
LEVDGTVTTQ PVLEHVGIST WPGRLILTDH ALYFEALRVV SYDKAKQYDL SDDLKQVVKP  780
ELTGPWGTRL FDKAVFYKSV SLSEPAIIEF PELKGHTRRD YWLAIIREIL YVHRFINKYQ  840
IKGIKKDEAL SKAVLGILRL QAIKEISSQT DLRYEGLLMF NLCDQLPGGD LILETMADMS  900
TFKEFDRSSK SKLGGGMYSI SALDMISNLG FSFGTSSSSP VEAGLAVGEI TVGEVTLLEK  960
AVKESKNNYE KVALAQATVD GVKVEGIDTN FAVMKELLFP MMELWTCLLS LALWEDPLKS  1020
LLFCCVFTYI ICRVSFHTTP VGFLSRLTQA ALFRGWLSYA FALTLIFIAV FMVLTRYFSE  1080
GKSTHEVKVL APPAMNTMEQ LLAVQNAISQ AEGIIQDGNV VLLKLRALLL SLFPQASERF  1140
AIALVVMALA LAFLPVKYIV LYIFLETFTC YSPVRRSSTE RWTRRLREWW FSIPAAPVIL  1200
EREKEEKKKK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1412434RKKARKMWEGWGKDGGRWGKRKK
2431442KRKKQRVEKEKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G73830.12e-51bHLH family protein