PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP012295.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family bHLH
Protein Properties Length: 1812aa    MW: 197265 Da    PI: 6.9575
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP012295.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH45.91e-14846892455
                  HHHHHHHHHHHHHHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHHHHH CS
          HLH   4 ahnerErrRRdriNsafeeLrellPkaskapskKlsKaeiLekAveYIksLq 55 
                   hn+ E+rRR+riN+++  L+ l+P++      K +Ka++L +++eY+k+Lq
  PCP012295.1 846 VHNMSEKRRRSRINEKMKALQNLIPNS-----NKTDKASMLDEVIEYLKQLQ 892
                  6*************************7.....5******************9 PP

2HLH382.9e-1210831129555
                   HHHHHHHHHHHHHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHHHHH CS
          HLH    5 hnerErrRRdriNsafeeLrellPkaskapskKlsKaeiLekAveYIksLq 55  
                   h   Er RR++++s f++L  llP++      K +K++i+ +A++YIksL+
  PCP012295.1 1083 HTWTERGRRKKMRSMFSNLHALLPQL----PAKADKSTIVDEAIRYIKSLE 1129
                   6668*********************6....39*****************96 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1812 aa     Download sequence    
MQGQRGTIGS LPEMLDFDHG SASNDAALDQ QICWNSTRNP AERIPDYVQS PGDMNVVNSM  60
GQERQHLSRW CLGEPSSSNS QSEVSRDERK PELGWSSSVD PCVEAASRLE ERRYEPTNNL  120
SLGNNVNTIF TQSSSSDAIP QNLNLNAAFA GHGGDNSQVM ECPNTFKYNV SENEQIRPPS  180
GSNPFMLPSG TAGFLMGEND GRSGSSLEGR RVSCKRKAME GNIGQSSLSG SCSYFQHTDS  240
GGRTSVPAHY NAGGSLSIST PSEQGNSRLG VVGRALASDS LPDSNAGRSS EGTHRNFRVR  300
INPSNQQNSI PSNRFPSGRA ARHSSISSSH HSPSLLPVDH NMDLRPAPAL DNTSSQNPPV  360
VIHVPALPQN VQSLRWSGGS SSRTGSLSNS LVFGDRDAPQ PPPLEEGSSR SMARHILDHP  420
VFVPATELRN SARHPAGASN RNLPGGNASI PGNVASASRT GTSSGVNPAA APTWVPPHNS  480
HPQFPRRLSE YVRRSLFSAS GSEPGSHGTN YLPMRSGPAS SPEVGSSSGT GNQGHHQSHP  540
RSASWMERHG DGGLGLPYSL RTLAAAGGEG SSRLVSEICN VLGLMRRGEN LRFEDVMILD  600
QSVLFGVADI HDRHRDMRLD VDNMSYEELL ALEERIGNVN TGLSEETISK RLKHKKYVAV  660
GSPADTEPCC VCQVFFAIPW CCSIGLFKAP DSEEISNILS QLIHGHGSSA SSCMPFKPTY  720
MHSSVPPPIQ AATPSEVLIP EARHEDHRRF AQLVNQSGTD QRVARGNSNS AGVLDSSTAF  780
DFYDSGGYFT KEVKEGMESD GRRISSENDL GGYSCDSEKD LEEAADGQLN SAPPRSSSKR  840
SRAAEVHNMS EKRRRSRINE KMKALQNLIP NSNKTDKASM LDEVIEYLKQ LQLQVQMLTM  900
RNGLSLHPMC LPGVMQPMQL PHIFEEGSNK FPKSGKGISP FYGTHENSVL SAFNLSAGCM  960
ISNPPMVLPS VANVATSEAT FGFEPSIQAH YRPFSVPSSS KHSTYFAYTL YQSIVKLSCQ  1020
KTCGGGGGGS GSGELLKLEK VSSSCPTPTN TMEQKEVMIK ESNGKVGGRD RDGGGDESDH  1080
GVHTWTERGR RKKMRSMFSN LHALLPQLPA KADKSTIVDE AIRYIKSLEH TLQTTQTQGL  1140
DKFSALTVSN TELTCDTREA FLADHFHQGP PTSNNLALPA SLFPPPASFQ TWFSPNFVMN  1200
MSGNDAQISV CSPPKPGLLT TMFYILEKHK LDVVSAHVSS DRCHCMYMIH AHAGGACDNF  1260
PEALSVEYIF KLAAGGGVEL ESTMSGYSFL NDQLSKRTSI FGLHLWVVLG ICVGAAIVLV  1320
LFLISLWFTS RRNSASSKAK TFTHNSSTIP NVSKEIQEIR IDHARNHPSQ PDPKPTNHQS  1380
HPDPVADSDS SGARPQPLLL QQDDHHESPA GSAGRQRIHI EIGKDHRILY PEKGGGGGGS  1440
SHGSGEVRSG DQGMIAAPEV SHLGWGHWYT LRELEDSTNG FADENVIGEG GYGIVYRGVL  1500
EDNTIVAVKN LLNNKGQAEK EFKVEVEAIG RVRHKNLVRL LGYCAEGAHR MLVYEYVDNG  1560
NLEQWLHGDV GPTSPLVWES RMNIILGTAK GLTYLHEGLE PKVVHRDIKS SNILLDKQWN  1620
SKVSDFGLAK LLGSERSYVT TRVMGTFGYV APEYASTGML NERSDVYSFG ILIMEIISGR  1680
NPVDYSRAPG EVNLVEWLKA MVTNRNAEGV LDPRLPEKPS SRALKRALLV ALRCVDPNAQ  1740
KRPKMGHVVH MLEADEFPFR DESRAGREHG RSPRDAGRMD KRVIESGDSS GYESSAQTNM  1800
SLWRKQEVEE EH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
110671075GGRDRDGGG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G36930.12e-40bHLH family protein