PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_028118350.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; Ericales; Theaceae; Camellia; Camellia sinensis
Family CPP
Protein Properties Length: 978aa    MW: 106251 Da    PI: 5.507
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_028118350.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.22.2e-15550588341
             TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                     +k+CnCkk+kClk+YC+Cfaag +C+e C C++C+N+ e
  XP_028118350.1 550 CKRCNCKKTKCLKLYCDCFAAGIYCAEPCACQGCFNRPE 588
                     79**********************************876 PP

2TCR48.12.2e-15637675139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks C+kkYCeC++a++ Cs+ C+Ce+CkN 
  XP_028118350.1 637 RHKRGCNCKKSMCVKKYCECYQANVGCSSGCRCEGCKNV 675
                     589***********************************6 PP

Sequence ? help Back to Top
Protein Sequence    Length: 978 aa     Download sequence    
MDSPEKVKTS TSIAAATTLP SDSVAAPDST IFGYISNLSP IKPVKAAPVV PGFPSLNSPP  60
IVFTSPHLNP LCETTYLKRS QCPQLSCEEL CQQDISDNKF EIGSNELEKS KTQSSSTLIP  120
CVERECDTKG SVPDQAASPP ECIDKYLADT VEVNCENSAH SANLSLIQSD DDMPQSIKDF  180
TDSKLDDEND IRQDAEILMG AFPATPELAG QDFQGKSSFN DEPVETDAKQ GGSEMPSCEC  240
PKVESGLSVE QQCELSVAQH AGAVHEDELG CASQFLPESL QPSQGYEDCS EITGEASNKI  300
ADNIVLHDPK GQHYSGMRRR CLRFEEARGN IVENSPGSRG SSNVVTSSSL PANLADMEVF  360
ESSSLEISAT SSGRHLINLT QPIISVIPPR NNGNPRSSIS KPSGIGLHLN TIVNAKPMSC  420
GVSGSMQSAE KGYLNVQDGK LVSNMDCHQP ESTKNSSILS NVREKLPVSS EDCSHETQVS  480
VATSYLPSQY LQIVKSSTDI APLKQIEYET TLCDKRKSIS KHAHAVESSN LSSPKKKRKK  540
ASATNDGDGC KRCNCKKTKC LKLYCDCFAA GIYCAEPCAC QGCFNRPEYE DTVLDTRKQI  600
EFRNPLAFAP KIVQGLTDSS ANSIGEDGNY STPSSERHKR GCNCKKSMCV KKYCECYQAN  660
VGCSSGCRCE GCKNVYGKKE EYGMTKDVLS EGPVNERFES TFDELEMAAS RDGLLQTELC  720
NLDNFIPPTP SFQFLDHGNC APKSRFSTRR CLPSPESNLT LPKSLETSDN NHDLLLKARK  780
DIQGIDYYCQ ELDFSNAETV NELSPRYDGL SNMCDLSTLP NPPSTAMAYS ASSKISDWTT  840
ISRAQLCARS GHLSSVSSFH WRGSPITPVA QFGGTKLLGA VESDKKHHDI PEDDTPEILK  900
DTSTPLKAVK VTSPNKKRVS PPHRRLHELG SSSSAGLRSG RKFILRAVPS FPPLTPCIDS  960
KDSSSQNIND PLDCSSKK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1536540KKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.12e-57CPP family protein