PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_020686454.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; Asparagales; Orchidaceae; Epidendroideae; Malaxideae; Dendrobiinae; Dendrobium
Family CPP
Protein Properties Length: 774aa    MW: 84292.7 Da    PI: 6.5327
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_020686454.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.54.1e-16482519340
             TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                     +++CnCkkskClk+YCeCfaag++C e C+C++C Nk 
  XP_020686454.1 482 CRRCNCKKSKCLKLYCECFAAGVYCVEPCSCQGCLNKP 519
                     789*********************************96 PP

2TCR50.63.9e-16566604139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks+ClkkYCeC++ g+ Cs +C+Ce+CkN 
  XP_020686454.1 566 RHKRGCNCKKSNCLKKYCECYQGGVGCSFSCRCEGCKNA 604
                     589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 774 aa     Download sequence    
MDTPERSKVG GTPISKFEDS PVFNFINSLS PIQPVKSVHS IHTFHSPGSA AISSVFSSPH  60
VSFHKESRFF IRHPFPDSVK QDASSDGLDE SNVHSGVSNG AVLSASIVLS QENCNITCSL  120
NEASVDLADD SSTQPDFSQS VQVDNGSPNH NTSPFYGVKL DFKLDISCTP SERVKLVQNM  180
SENTKNPHTT KIGLEVQFPA DQMKEDASES DWDNLISDNT DDLLIFETAT GSETSKSEEK  240
DYGISLNSHP NASGLHNVSQ DSFMDDDGET GEHDGMSPTR SSACQEHENF VDLNLKTEAE  300
LNNGVPLGCK VDSQQHRGVR RRCLVFEAAG VSKKNIHNNS NAPPSISLPI NAKPNSENRQ  360
LVSSKTSKTP VPYVLPGIGL HLNALARTSK DRPISQKTKT PGREFISRPC SMRPFSPAIA  420
GENQSFKLSD PHQSPDPNGS ELHDSQAVHS DVSLIPALGI EESTSPKKKR RKTENSGENE  480
GCRRCNCKKS KCLKLYCECF AAGVYCVEPC SCQGCLNKPV HEETVLTTRK QIESRNPLAF  540
APKVIRTSEH DQEMGEESNK TPASARHKRG CNCKKSNCLK KYCECYQGGV GCSFSCRCEG  600
CKNAFGRKDG FEEIEFEEDL DACEKEGVKS DDGKANANVP KLEHNHLSES ILPVTPFPSC  660
RSSVKLPFLS SGKPPRSSTL SLMNRSENQQ AECKSDQIFH DDDTSEDLRV NSSPCTAVKT  720
ASPNGKRVSL PLNGFGLSPN LKGGRKLILK SIPSFPSLTT DAPSSTQINS SVFS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1466472PKKKRRK
2467471KKKRR
3467472KKKRRK
4468472KKRRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-125CPP family protein