PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_028118351.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; Ericales; Theaceae; Camellia; Camellia sinensis
Family CPP
Protein Properties Length: 956aa    MW: 103930 Da    PI: 5.6841
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_028118351.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.22.1e-15528566341
             TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                     +k+CnCkk+kClk+YC+Cfaag +C+e C C++C+N+ e
  XP_028118351.1 528 CKRCNCKKTKCLKLYCDCFAAGIYCAEPCACQGCFNRPE 566
                     79**********************************876 PP

2TCR48.22.2e-15615653139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks C+kkYCeC++a++ Cs+ C+Ce+CkN 
  XP_028118351.1 615 RHKRGCNCKKSMCVKKYCECYQANVGCSSGCRCEGCKNV 653
                     589***********************************6 PP

Sequence ? help Back to Top
Protein Sequence    Length: 956 aa     Download sequence    
MDSPEKVKTS TSIAAATTLP SDSVAAPDST IFGYISNLSP IKPVKAAPVV PGFPSLNSPP  60
IVFTSPHLNP LCETTYLKRS QCPQLSCEEL CQQDISDNKF EIGSNELEKS KTQSSSTLIP  120
CVERECDTKG SVPDQAASPP ECIDKYLADT VEVNCENSAH SANLSLIQSD DDMPQSIKDF  180
TDSKLDDEND IRQDAEILMG AFPATPELAG QDFQGKSSFN DEPVETDAKQ GGSEMPSCEC  240
PKVESGLSVE QQCELSVAQP SQGYEDCSEI TGEASNKIAD NIVLHDPKGQ HYSGMRRRCL  300
RFEEARGNIV ENSPGSRGSS NVVTSSSLPA NLADMEVFES SSLEISATSS GRHLINLTQP  360
IISVIPPRNN GNPRSSISKP SGIGLHLNTI VNAKPMSCGV SGSMQSAEKG YLNVQDGKLV  420
SNMDCHQPES TKNSSILSNV REKLPVSSED CSHETQVSVA TSYLPSQYLQ IVKSSTDIAP  480
LKQIEYETTL CDKRKSISKH AHAVESSNLS SPKKKRKKAS ATNDGDGCKR CNCKKTKCLK  540
LYCDCFAAGI YCAEPCACQG CFNRPEYEDT VLDTRKQIEF RNPLAFAPKI VQGLTDSSAN  600
SIGEDGNYST PSSERHKRGC NCKKSMCVKK YCECYQANVG CSSGCRCEGC KNVYGKKEEY  660
GMTKDVLSEG PVNERFESTF DELEMAASRD GLLQTELCNL DNFIPPTPSF QFLDHGNCAP  720
KSRFSTRRCL PSPESNLTLP KSLETSDNNH DLLLKARKDI QGIDYYCQEL DFSNAETVNE  780
LSPRYDGLSN MCDLSTLPNP PSTAMAYSAS SKISDWTTIS RAQLCARSGH LSSVSSFHWR  840
GSPITPVAQF GGTKLLGAVE SDKKHHDIPE DDTPEILKDT STPLKAVKVT SPNKKRVSPP  900
HRRLHELGSS SSAGLRSGRK FILRAVPSFP PLTPCIDSKD SSSQNINDPL DCSSKK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1514518KKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.12e-57CPP family protein