PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_028124821.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; Ericales; Theaceae; Camellia; Camellia sinensis
Family CPP
Protein Properties Length: 542aa    MW: 58790.4 Da    PI: 7.9025
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_028124821.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR45.81.2e-1497136242
             TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkeek 42 
                     ++k+C+C++s+Clk+YCeCfaag +C+  C+C++C+N+ e+
  XP_028124821.1  97 KQKQCHCRNSRCLKLYCECFAAGIYCDA-CNCTNCHNNVEH 136
                     89*************************9.********9876 PP

2TCR52.97.2e-17182221241
             TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                     ++kgCnCkks ClkkYCeCf+a++ Cse+CkC dCkN ee
  XP_028124821.1 182 HNKGCNCKKSGCLKKYCECFQANVLCSENCKCMDCKNFEE 221
                     789**********************************887 PP

Sequence ? help Back to Top
Protein Sequence    Length: 542 aa     Download sequence    
MEQSATVSDF PQRKLAKQLE STAMCRASAN AILPAHPQAQ LQAKLLALAK PQSPPPPPQS  60
KRQRRLRAAH PLPVATMESP KSQQQNNIEV KDGTPKKQKQ CHCRNSRCLK LYCECFAAGI  120
YCDACNCTNC HNNVEHDAAR QEAVGATLER NPNAFQPKIA NSPHKSQDGR EEAREVTAMA  180
IHNKGCNCKK SGCLKKYCEC FQANVLCSEN CKCMDCKNFE ENEERKALFH GDHANFMSYI  240
QQAANAAING AIGTSHLGTL PALKKRKNQK LFFGGTASDQ SIHRITQFQQ ENRLRAPATS  300
SSLLSAPGSC TISASSMGSS KLIYRSPLAD VLQSQDVKEL CSLLVIVSAE ATKALAEKNG  360
KMDKKGDQLE SSLVSSDQEK HSKVEDNVQT SIPDDCLSGD QVDTAGNSGS DGSDAQTGRP  420
PSPSTLALMC DEQDTMVMAT GSPDAAASNG SLTNQSSQGQ IFTELYAEQE RIVLTKFWNF  480
LNRLISCGSI KEAMCSPSPK IKLGIQQEPE NCVTTTKTET SHKELDGTSI VKSVDQVVAK  540
MS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
16199KRQRRLRAAHPLPVATMESPKSQQQNNIEVKDGTPKKQK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G29000.11e-135CPP family protein