PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP004165.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family MYB
Protein Properties Length: 2671aa    MW: 305308 Da    PI: 9.9617
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP004165.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding44.63.3e-1423602407148
                       TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
  Myb_DNA-binding    1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48  
                       +g W++eEde+l ++    G g+W+ Iar  g+ R++k+c++rw +yl
      PCP004165.1 2360 KGLWSPEEDEKLMRYMLTNGQGCWSDIARNAGLQRCGKSCRLRWINYL 2407
                       678*******************************************97 PP

2Myb_DNA-binding46.68.1e-1524132456146
                       TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding    1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46  
                       rg+++++E+el+++ +  lG++ W+ Ia++++ gRt++++k++w++
      PCP004165.1 2413 RGAFSPQEEELIIHFHSILGNR-WSQIAARLP-GRTDNEIKNFWNS 2456
                       89********************.*********.***********96 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2671 aa     Download sequence    
MRRKRRVPEV LRRLFGKRAR TLSATIVSLL PPHSASSVPD DCRFCKGRRC LSCTAPDGLS  60
FLLRPDDPSD YRHLLNHGYV VFAKNTPAVT RFSPDSHWSQ IEIVRAVIEA MMVEQPVSSN  120
VIYSGYDKSN QSSGIVELLT SSAWCLLLER VGDGIMVYLL RNASIFFPLL RTNHQQVTGP  180
PISKLCPKKL KCAPQPQWQQ SLPESYVPSL PIQSTPLSVV PIMRKKMRRK CRVPEVLRRL  240
FGKRARTLSD TIVSLLPPHY SSSVPDDCRF CKGRRCLSCT APDGLSFLLR PDDPSDYRHL  300
LNHGYVAFAK NTPAVTRFSP DSHWSQIEIV RAVIEAMMVE QPVSSNVICS GYDKSNQSSG  360
IVELLTSSAW CLLLERVGDG IMVYLLRNAS IFLPLLRTNH QQVTGLPISK LCPKKLKRAP  420
QPLWQQSLPE SYGPRKKRER EDNVHSLLKR QHLSSFSSDK TVGSAACCGC CSVCKDKHCH  480
NGTSTFTNIY EGDLNKELEQ GSQRLKKRAR PFSWQRRKKR RLLPSQETNF QDPSKTIIGD  540
KESLSSRLSY SSVHHHKKCS CLGLRIPRKV AKGAQIDRKF IFFNLEQSSS VFPRKRPRKK  600
REREDNVHSL LKRQHLSSFS SDKTIGSAAC SGCCSVCKDK HSHNGTSTFT DRYEGDLNKE  660
LEQGSQRLKK RARPFSWQRR KKHRLLPSQE TSFQDLSKTI IGDKESLSSR LSYCSVHHHK  720
KCSCLGLRIP RKVAKGAQID RKFMFFNLEQ SSSVFPRKHV LNSLKPNSVG SEFLVKSIFG  780
ISDTEGAMSK ICPRGSGLCL MGSACLYHSL VKLLKTLIRR AQHCHSLRLL DKHCGVSSPD  840
ATDSHSEAIK SYCLKSQVVS FMWAVCKRII PSDLLGTASN WRILRRNISK FIHLRRFENF  900
SLRQCMHKLK TSRFPFLSNK QYFCSMNNEA LKDMGGKGLD IHKGFPRLND AAHIVKQKVL  960
KSWIYWFFSS IIVPLLQANF YITESEHGRQ DVYYYRKSVW EKVKNKTISC MKDQRYYCLD  1020
DATTRRIIRK RLFGFSKLRI CPKECGVRLL ANLKASSKMP RKEFSLAEQS SGIVRRKKSL  1080
QKRVKFEYFK SVNSVLCDTH AVLKAVRLKE PEKLGSSVFD YNDVYRKLCP FVMGLKNGPT  1140
MMPDVFIVVS DVSKAYDTVD QDKLLCVMKD AIRTDEYFLK HSYEVLCTKE FLWVHENPAL  1200
LDQHTSLRFK SSALHRSLQS VLVNQICPRE SGLCLTGSAC LYHSLVKLLK TLIRRAQHCH  1260
SLRLLDKHCG VSSPDATDSH SEAIKSYCLK SQVVSFIWAV CRRIIPSDLL GTASNWRILR  1320
RNISKFINLR RFENFSLQQC MHKLKTSRFP FLSNKQYFCG MNSEAVKDMG GKGLDIHKGF  1380
PRLNDAAHIV KQKVLKSWIY WFFSSIIVPL LQANFYITES EHGRQDVYYY RKSVWEKVKN  1440
KTISCMKDQR YYCLDDATTR RIIRKRLFGF SKLRICPKEC GVRLLANLKA SSKMPRKEFS  1500
LAEQSSGIVR RKKSLQKGVK FEYFKSVNSV LRDTHAVLKA IRLKEPEKLG SSVFDYNDVY  1560
RKLCPFVMGL KNVPTMMPDV FIVVSDVSKA YDTVDQDKLL CVMKDAIRTD EYFLKHSYEV  1620
LCTKEFLWVH ENPALLDQHA SLRFKSSALH RSLQSVHFNQ EYSRSMKKEE LFFNLYQHVK  1680
RNVLQLDKKF YLQGVGIPQG SVLSSLLCSL YYGHLDRNVI FPFLEKIWEP ATIDSSRGHN  1740
FGDASAAQSA NEDGIASSST YILLRFIDDF LFISTSRNQA AGFFTRLQRG FRDYNCYMNE  1800
KKFCVNFDIQ HMPGIPSSRV YLGEDGISFI RWSGLLLNSC TLEVQADYTK YWNNHLRSTL  1860
TVSWQDQPGR HLKKKLCDYM KPKCHPIFFD SNINSASVVR LNIYQAFLLC AMKFHCYVRD  1920
LSYVWKLRPR SYANIIKRSL RYMHVLIKRR MRSVHADFHP ILQLEKGEVE WLGLYAYIQV  1980
LKRKQSRHKE LISLLTSKLL KHTISGSVSS QLSSPSSQLQ YLNSSLFLLY LPMSGTALEQ  2040
GRSIQEFFYH AREEGSEDSA VIALCFSCFT ETGLVLLNFA LESIISCPFY CVDRIHSSDY  2100
IFHKVCYVEV PLVLFGGSGN YASALYIAAV KANALEKVES EILAIVEAMK KSPTFSQFTR  2160
DLSVPADTRV KAIDQIAAEA KFSEITKNFL VVLAQNGRLR NLETISKRFG ELTMAHKGEV  2220
KAIVTSVIPL PAEEEKELKE TLQELIGQGK KVILEQKIDP SILGGLVIEF DKKVFDMSIK  2280
TRARQMERFL WPVGQVQGVR VNIMDDEISI HHINYFNQKN AKKASPLAIT MRKPESIGKD  2340
DVDGAKTINK KKMMMSKLRK GLWSPEEDEK LMRYMLTNGQ GCWSDIARNA GLQRCGKSCR  2400
LRWINYLRPD LKRGAFSPQE EELIIHFHSI LGNRWSQIAA RLPGRTDNEI KNFWNSALKK  2460
RLKNNNMSTS TSSPNDSDSS DPRDPVVGGT FMPMHDHDMM TMYNMDSSSS STTSMQAMAV  2520
NSNTLFNPFY MLDNRYDMAH ADDVVVNPTC FPHLPNTGDN QGHYGDYGNL EGGHKMGLEG  2580
DLFLPPLESR SIKNNLNGQL VADNISKKNS SISNYHFHNS CFNNTELIGF KVDEEEDMFG  2640
FGNINGGQGE SVRVGEWDFE GFMLDTSSFL S
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1506519KKRARPFSWQRRKK
2506521KKRARPFSWQRRKKRR
3515521QRRKKRR
4669682KKRARPFSWQRRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G08500.14e-66MYB family protein