PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022138009.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Momordiceae; Momordica
Family CPP
Protein Properties Length: 788aa    MW: 85763.4 Da    PI: 5.8024
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022138009.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.73.4e-16491529240
             TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                     ++k+CnCkkskClk+YCeCfaag++C e C+C+dC+Nk 
  XP_022138009.1 491 SCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKP 529
                     689**********************************96 PP

2TCR50.54e-16576614139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks+ClkkYCeC++ g+ Cs +C+Ce+CkN 
  XP_022138009.1 576 RHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCKNA 614
                     589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 788 aa     Download sequence    
MDTPERNQIG TTSVSKFEDS PVFNYINNLS PIQPVKSIHT TQTFNSLSFA SLPSVFTSPH  60
VSSYKESRFL RRHSDADLSK PDFSSENGNK VGTDGGVTGN VAQLPDNSSV PHGNSDSLAN  120
PSTELPSEHL LSTKEEPQAT KCDSPRPDCD QKLPDCVIKL GEDPPLSNPC VQNKSGNGSS  180
EAEGNKQVIS HTEEREGTGY DWESLITEGA DLLIFSSPNG SEAIRLVQKP LDLVTGFTSS  240
TLSQIMENDN TNLSKMRIAD PIESSGVQHE IEFPSSQAGE ACEIKDMDQA VEGLSYPNSN  300
NSMSREITDD EVAKYITDDC KPVSNLYRGM RRRCLDFEAA VSRRKNLEDG SNSGSVSTHA  360
EEKMTSMDKQ LIPYKSGGVP ARCMLTGIGL HLNALATTSK DTKNLNHEKF SSERQLSLPN  420
SSASCHSPST GLDPLLTSVV TERDVDPSEN GVQNEEDASR ASAYVLAEDF NQNSPKKKRR  480
RLEHAGETES SCKRCNCKKS KCLKLYCECF AAGVYCIEPC SCQDCFNKPI HEDTVLATRK  540
QIESRNPLAF APKVIRNSDS LPEPGDESNK TPASARHKRG CNCKKSSCLK KYCECYQGGV  600
GCSISCRCEG CKNAFGRKDG SSLLGIETEQ EEEETESNQK SVVDKALQGP EIQNNEEQNP  660
GSAIPSTPLQ LCRQLVSLPF SSRGKPLRSS FLNVGSSSGF YAGHKLEKPN ILRPQPDLGK  720
NTKPVSEDEM PDILRGDCSP GAGVKTGSPN SKRISPPQSD FGLSPLPRTG RKLILQSIPS  780
FPSLTPQH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1475482PKKKRRRL
2476481KKKRRR
3478482KRRRL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.11e-143CPP family protein