PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_028118353.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; Ericales; Theaceae; Camellia; Camellia sinensis
Family CPP
Protein Properties Length: 952aa    MW: 103460 Da    PI: 5.4309
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_028118353.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.22.1e-15524562341
             TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                     +k+CnCkk+kClk+YC+Cfaag +C+e C C++C+N+ e
  XP_028118353.1 524 CKRCNCKKTKCLKLYCDCFAAGIYCAEPCACQGCFNRPE 562
                     79**********************************876 PP

2TCR48.22.2e-15611649139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks C+kkYCeC++a++ Cs+ C+Ce+CkN 
  XP_028118353.1 611 RHKRGCNCKKSMCVKKYCECYQANVGCSSGCRCEGCKNV 649
                     589***********************************6 PP

Sequence ? help Back to Top
Protein Sequence    Length: 952 aa     Download sequence    
MDSPEKVKTS TSIAAATTLP SDSVAAPDST IFGYISNLSP IKPVKAAPVV PGFPSLNSPP  60
IVFTSPHLNP LCETTYLKRS QCPQLSCEEL CQQDISDNKF EIGSNELEKS KTQSSSTLIP  120
CVERECDTKG SVPDQAASPP ECIDKYLADT VEVNCENSAH SANLSLIQSD DDMPQSIKDF  180
TDSKLDDEND IRQDAEILMG AFPATPELAG QDFQGKSSFN DEPVETDAKQ GGSEMPSCEC  240
PKVESGLSVE QQCELSVAQH AGAVHEDELG CASQFLPESL QPSQGYEDCS EITGEASNKI  300
ADNIVLHDPK GQHYSGMRRR CLRFEEARGN IVENSPGSRG SSNVVTSSSL PANLADMEVF  360
ESSSLEISAT SSGRHLINLT QPIISVIPPR NNGNPRSSIS KPSGIGLHLN TIVNAKPMSC  420
GVSGSMQSAE KGYLNVQDGK LVSNMDCHQP ESTKNSSILS NVREKLPVSS EDCSHETQQI  480
EYETTLCDKR KSISKHAHAV ESSNLSSPKK KRKKASATND GDGCKRCNCK KTKCLKLYCD  540
CFAAGIYCAE PCACQGCFNR PEYEDTVLDT RKQIEFRNPL AFAPKIVQGL TDSSANSIGE  600
DGNYSTPSSE RHKRGCNCKK SMCVKKYCEC YQANVGCSSG CRCEGCKNVY GKKEEYGMTK  660
DVLSEGPVNE RFESTFDELE MAASRDGLLQ TELCNLDNFI PPTPSFQFLD HGNCAPKSRF  720
STRRCLPSPE SNLTLPKSLE TSDNNHDLLL KARKDIQGID YYCQELDFSN AETVNELSPR  780
YDGLSNMCDL STLPNPPSTA MAYSASSKIS DWTTISRAQL CARSGHLSSV SSFHWRGSPI  840
TPVAQFGGTK LLGAVESDKK HHDIPEDDTP EILKDTSTPL KAVKVTSPNK KRVSPPHRRL  900
HELGSSSSAG LRSGRKFILR AVPSFPPLTP CIDSKDSSSQ NINDPLDCSS KK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1510514KKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.12e-57CPP family protein