PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_028053539.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; Ericales; Theaceae; Camellia; Camellia sinensis
Family CPP
Protein Properties Length: 933aa    MW: 101590 Da    PI: 5.1713
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_028053539.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR43.37.4e-14564601340
             TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                     + +CnCkkskClk+YC+C aag +C+e+C+C++C+N+ 
  XP_028053539.1 564 CIHCNCKKSKCLKLYCDCLAAGIYCDETCTCQECFNRL 601
                     679*********************************85 PP

2TCR46.95.5e-15651689139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks C kkYCeC++a++ Cs  C+Ce+CkN 
  XP_028053539.1 651 RHKRGCNCKKSMCSKKYCECYQANVGCSTGCRCEGCKNV 689
                     589***********************************6 PP

Sequence ? help Back to Top
Protein Sequence    Length: 933 aa     Download sequence    
MGSPEIDKTT TPTTTTTTTT TTNTNTSPSD SVTIQDSAIF SYISNLSPIK PVKAAPMAAG  60
FSGLGSPPSV FTSPHLNRHQ ETSYLKRPRC RQLCSAELSQ QDVRGKKIAI CSNEIEKSET  120
QGSSVLVPCT EKECENKGSV QGQATSPSGC VDDYLADSVE VDCANSVHSG SLSLIQSGDV  180
PKSLNDCNDL KEMIPKLDEK NDIGQYAEKA LGAFPATSEL AGQNFQEKSS FDNKPVETDT  240
KQGSSEMTPN ICPIVESDLS VDKALEEHYD LPVAQHVVAA HKEKLDCAIQ FLQESLQPIQ  300
GYGDSSMTAI QASDRHVENI ILHDPKHSGM HRRCPWFEEA HQNIMTNSPG FGSPSNIVTN  360
SRLSTSHADL EVFESSCLEV SAASSGRQLI NLTQPMISHR NSGSKSAVLK PSGIGLHLNS  420
IINAMPIACG AIGSMKSAEK GYMNVEGRKL ICQQPANTKS GSILRNVREK VSASSEDGSY  480
ETQALAAITS LASQSSQIEK PSNDPALLKQ GEYQTTPCDK GKSISKRADA VEELNRSSPK  540
KKRKKAWTTN DDDDDDDDDD DYSCIHCNCK KSKCLKLYCD CLAAGIYCDE TCTCQECFNR  600
LDYEDTVQET RQQIESRNPL AFAPKIVQHV TDSPANNNGE NGNNSTPSSA RHKRGCNCKK  660
SMCSKKYCEC YQANVGCSTG CRCEGCKNVY GRKEEYDMTK DALSKGPIHE SFENTFDEKL  720
EMVSNQEGLL QTELCNPQNL MPLTPSFQFS NHWKDVSKSQ FSTRRSLPSP ASNVTFLPPH  780
GKSQRSPENS DSHGMLLKAR KHDEDVVSCY QGLDYSNAET VDGFSLRCDE LAIGNDLSTL  840
TNPPSTTIAS PLSSKLSDWT TISRSQSCPV SGHLSSIGSD EKLPDTMEDN MTEILKDTST  900
PLDALKVMSP PCIDSKDSTS QNVNDPEDCS SKK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1540573KKKRKKAWTTNDDDDDDDDDDDYSCIHCNCKKSK
2541574KKKRKKAWTTNDDDDDDDDDDDYSCIHCNCKKSK
3541545KKRKK
4542573KRKKAWTTNDDDDDDDDDDDYSCIHCNCKKSK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.19e-56CPP family protein