PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0001623.1_g310.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family CPP
Protein Properties Length: 1051aa    MW: 115102 Da    PI: 6.3254
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0001623.1_g310.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR51.22.5e-16619656340
                        TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                                +k+CnCkk+kClk+YC+Cfaag++C+e+C C++C+N +
  Pav_sc0001623.1_g310.1.mk 619 TKRCNCKKTKCLKLYCDCFAAGVYCAESCACQGCFNIT 656
                                699*********************************76 PP

2TCR50.15.5e-16696734139
                        TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                                ++k+gCnCkks ClkkYCeC++a++ Cs+ C+C++CkN 
  Pav_sc0001623.1_g310.1.mk 696 RHKRGCNCKKSMCLKKYCECYQANVGCSSGCRCDGCKNV 734
                                589***********************************6 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1051 aa     Download sequence    
MDSPEIRKIK ANTTSSSDSP PVQDSPVFSY INNLSPIQPV KASHLAQGFP VLNSPPLVFT  60
SPRINSHRDT SFLKRPQYQP LSSAEKPKTQ DEAKKFLDDP VDPKITQLHM RLITDSQDRE  120
TKIRVESQQC SSSEGVDEYL ADPSEVDCVD STQSASPCLK QSNNVPETFT GSKETTLYDN  180
KHNTGTDLGT EAKAPSEQAK EDLEGKQTFD AKPVKIIEQS DGDLPYDECP NEYLADPSEV  240
DCVDSTQSAS PCLKQSNNVP ETFTGSKETT LYDKKHNTGT DLGTEAKAPS EQAKEDLEGK  300
QTFDAKPVKI IEQSDGDLPY DECPNIESGL SIDNAYKREY RQHLHDQDRG GKHQDDCDHT  360
PQSPPGRLQI VQVYENSAEN VGAISKGMIG NMILHAPKAR SEQGGMHRRC LQFEEAPPCA  420
TGERDCSLSS IQEVNNSELP SGTGESKLVK LSYADLKATS KRQMGTSLPP RYGGNSPSTV  480
PKPSGIGLHL NSIVNAAPLV RGTTRSIKLA DHYIGVQVMK SASVMSSHLP DNVRSRSISL  540
NMVEKDSAGP EDRDESETSI TASSAVPQSP HTVVFEHHGP THEKRGFDAE NVDDYEECKQ  600
SSPKKKRKKT SSTKDSDGTK RCNCKKTKCL KLYCDCFAAG VYCAESCACQ GCFNITDYED  660
TVLETRQHIE SRNPLAFAPK IVQHEEEIQF TPSSARHKRG CNCKKSMCLK KYCECYQANV  720
GCSSGCRCDG CKNVYGRKGE YIPIEHGVGK DNVSDKAGKD RIESTFDEKL EMVATKKDIL  780
SAELYNSHNL TPLAPSFQCS DHANNVPKSP CLPTSYLPSP ESDLTIISSY ENSTRSPLRH  840
SESSDILLET SKELSDLGSY NWRVDYDNIG IVDTFSPRCD AAPTTCHITP MSDLCSMAMA  900
SSTSSKTSDW TNASQVQLCP GSHGLSSDSS LHRRSSPVTP MTRLGGTKSF QGLDFENGLY  960
DILQDDTPEI LKDTSTPIRS LKVSSPNKKR VSPPHSHNHE LGASSSGALR SGRKFILKAV  1020
PSFPPLTPCI GSKGGSIIQN MSNLQDKGRK K
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1605609KKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.13e-53CPP family protein