PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_023914139.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fagales; Fagaceae; Quercus
Family CPP
Protein Properties Length: 617aa    MW: 67521.8 Da    PI: 8.3546
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_023914139.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR444.4e-14152191242
             TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkeek 42 
                     ++k+CnC+ s+Clk+YCeCfaag +C++ C+C +C+N+ e+
  XP_023914139.1 152 KQKQCNCRSSRCLKLYCECFAAGIYCDN-CNCLNCQNNVEN 191
                     89*************************9.********9875 PP

2TCR50.15.4e-16236275140
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                     k++kgC+Ckks ClkkYCeCf+a++ Cse+CkC +CkN e
  XP_023914139.1 236 KHSKGCQCKKSGCLKKYCECFQANVLCSENCKCMECKNFE 275
                     589***********************************65 PP

Sequence ? help Back to Top
Protein Sequence    Length: 617 aa     Download sequence    
MEKDDETNSD SAPKKSARQL DFSPQKPQPQ LQMRSQSLSQ PQPQQQPLSP VQLQLQLQLQ  60
LQSQSQLQLR PHAPMLPQLQ PRPPPPPPMS PQWHVQLPLP PLSPQLQPRP PQQPVLRSHP  120
VHKLPIPAVT KQESPRPKPR ANAEAKDSTP KKQKQCNCRS SRCLKLYCEC FAAGIYCDNC  180
NCLNCQNNVE NEAARQEAVG VTLERNPNAF RPKIASSPHE PRDSREDVAE VQVVGKHSKG  240
CQCKKSGCLK KYCECFQANV LCSENCKCME CKNFEGSQER RDLIHEDHNA MANIQQAKLA  300
NAAISGAIGS FGYGTPLVPR KRKSHELCQR ENHLRASSIP DSCIANAAVS GSSNFKYRST  360
LANILQPQDV KDLCSHLVEL SAEATMTRTG NSGKMDRETD RKNIEISTAL STQESEVAEN  420
GHGSQKAIPG NHLSRNQVDR DGSTDSGSDG VDMDNGRPMS PGTLALMCDE QDTMFMDGGS  480
PKGVLSDGQN TRQKSSNGHG SELYAEQERL VLARLRDFLN RIIACGSIRE TMPSPVAKNG  540
TGSQQKPVEN GNVKSGSEAR SHKEAYGNGI TKSPVTASAE ASSRAHPVTS DNKDLSSKVG  600
LPNGNGEKIK PNTDREL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1321325KRKSH
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G29000.11e-115CPP family protein