PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_028108166.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; Ericales; Theaceae; Camellia; Camellia sinensis
Family CPP
Protein Properties Length: 813aa    MW: 88231 Da    PI: 5.4798
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_028108166.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR49.77.1e-16520557340
             TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                     +k+CnCkkskClk+YCeCfaag++C e C+C+dC Nk 
  XP_028108166.1 520 CKRCNCKKSKCLKLYCECFAAGVYCVEPCSCQDCYNKP 557
                     89**********************************96 PP

2TCR512.8e-16604642139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks ClkkYCeC++ g+ Cs +C+Ce+CkN 
  XP_028108166.1 604 RHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNA 642
                     589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 813 aa     Download sequence    
MGEEGEEGKE EKEGEKLERE SESEECGRGG GGGGGPPMDT PDRSRIATPL YKFEDSPVFN  60
YINNLSPFET VKPIRISQPF NSVNFASLPS IFTSPHVSSL KGSRFLRRHQ FSDPSKPEFS  120
SDNRNKVGSN DGVIDDAQDS DELQESFDPG SFIGEGPVES TYECSKLANV ASRTLNYDYG  180
SPDCRTMPCH GIDTNCVSKL AGTSGSVPFV QEVSQTGFVG SGVHLDGIRN IDHYKDWATM  240
ISDANDLLIF HSPDDADAFK GTLQKPLDSE TRLCASLGSE FTHDEVSDLQ KTQAVGPVGV  300
EQHESENPPP QVGEEDHLKE IETTRDALDG TSLNKCVPVE PGEQLDNEPI SNLHRGMRRR  360
CLVFDMAGAR RKHLDDSSSS GSSMLPQPDV NIESNDKQLV PTKPGNDSSH CILPGIGLHL  420
NALARTSKEY SVVSHETLSS GRQLINDPSS TASIHSLNTG QDQYLAVTSM EGNMGLTESG  480
VPGMEDASQA SGYIASEELN QSSPKKKRRR LENAGESEGC KRCNCKKSKC LKLYCECFAA  540
GVYCVEPCSC QDCYNKPIHE DTVLATRKQI ESRNPLAFAP KVIRSSDSVP EIGDESSKTP  600
ASARHKRGCN CKKSGCLKKY CECYQGGVGC SINCRCEGCK NAFGRKDGSG TIGAEAELEE  660
EPEVSEKSVI HKSLHTNLIQ NSSEQNPDST LPATPLHSGR ASVQLPFSKN KPPRSFLSIG  720
SSSGFFGNQR FGKPSFLPPQ PKFEKPIQTV QEDEMPEILQ GNCSPIGAIK SASPNSKRVS  780
PPHCDFGSSP GRRSSRKLIL QSIPSFPSLN PKD
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1504511PKKKRRRL
2505510KKKRRR
3507511KRRRL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-158CPP family protein