PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022955434.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Cucurbiteae; Cucurbita
Family CPP
Protein Properties Length: 748aa    MW: 81439.3 Da    PI: 5.181
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022955434.1genomeNCBIView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR51.52e-16462500240
             TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                     ++k+CnCkkskClk+YCeCfaag++C e C+C+dC+Nk 
  XP_022955434.1 462 SCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKP 500
                     589**********************************96 PP

2TCR50.63.7e-16547585139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks+Clk+YCeC++ g+ Cs +C+Ce+CkN 
  XP_022955434.1 547 RHKRGCNCKKSNCLKRYCECYQGGVGCSISCRCEGCKNA 585
                     589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 748 aa     Download sequence    
MDTPERNQIG TTAAPKFEDS PVFNYINNLS PIQPVKSIHT AQTFNSLSFS SLPSVFTSPH  60
TSSYKESRFL RRRCYSDLSK PDFSSENGNK GVTGNVDELQ VNSSAPDDKS ESSTKLSSEP  120
QALKSHSSRP DCDRPLPDSD DRAVSDPCVE NNSGSGSSEA EGNRLVVSQT EEKEGTSYDW  180
ESLITEGADI LIFSSPNGSE AIRLVQKPSS TLAQMMENDD ANLSRMRIAD PIESSGGQHE  240
IEFSSSRAGE GCELKDMDQA IDDGISFPIS SDSMSREITD EEVARYIADD CMPVSNLHRG  300
MRRRCLDFEA VVSRRKNMED GSNGCSVSTH PEEKTTSMDK QLVPYKSGGV VTRYVLTGIG  360
LHLNALATTS KDAKNLNHER FSSERQLNLP NSGGSCHSSS TGLDPLSTSI VTEQDMDPSG  420
SGVQSEEDAA MASAYVLADD FNQNSPKRKR RRLEHAGETE LSCKRCNCKK SKCLKLYCEC  480
FAAGVYCIEP CSCQDCFNKP IHEDTVLATR KQIESRNPLA FAPKVIRNSD SLPEPGDESN  540
KTPASARHKR GCNCKKSNCL KRYCECYQGG VGCSISCRCE GCKNAFGRKD GSSLLGLETE  600
LEEEETEANQ MSVMDKALQG PEIQNNEEQN PGSAIPSTPL QLPRQLVPLP FSARSNPLRS  660
SFLNVGSSSG FYAGYKHEKP NTAPVSDDAI PNILRGDGSP GAGVKTDSPN SMRISPPQSD  720
FGLSPLPRTG RKLILQSIPS FPSLTPQH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1446453PKRKRRRL
2447452KRKRRR
3448452RKRRR
4449453KRRRL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-123CPP family protein