PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID evm.model.AsparagusV1_03.530
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; Asparagales; Asparagaceae; Asparagoideae; Asparagus
Family CPP
Protein Properties Length: 469aa    MW: 50739.5 Da    PI: 8.7393
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
evm.model.AsparagusV1_03.530genomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR51.32.3e-16165202340
                           TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                                   +k+CnCkkskClk+YCeCfaag +C e C+C++C+Nk 
  evm.model.AsparagusV1_03.530 165 CKRCNCKKSKCLKLYCECFAAGIYCVEPCSCQGCFNKP 202
                                   89**********************************96 PP

2TCR53.26e-17249287139
                           TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                                   ++k+gCnCkks+ClkkYCeCf+ g+ Cs +C+Ce+CkN 
  evm.model.AsparagusV1_03.530 249 RHKRGCNCKKSSCLKKYCECFQGGVGCSASCRCEGCKNA 287
                                   589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 469 aa     Download sequence    
MRRRCLTFEL AGGSTKNSNN NSKLHPSVSS LSSRKFTLDD KLSILSKPES SSSQRVLPGI  60
GLHLNTLATT PSDRMVTKET LAFGKRFISM PCSTSPFPPI TAVQKSVNKS LDVIKDEHPT  120
GSGVPGLEVT HDDASEAPAD GENLMQGSPK KKKRKSENGG ESEGCKRCNC KKSKCLKLYC  180
ECFAAGIYCV EPCSCQGCFN KPIHEETVLA TRKQIESRNP LAFAPKVIRA SKPVQDMAEE  240
TNKTPASARH KRGCNCKKSS CLKKYCECFQ GGVGCSASCR CEGCKNAFGR KDGAEAVHAE  300
DIMDVSEKQQ DEQQDGQQND IAQKNEHYSP ESILPITPSF ETCRPFVKPP FPSSGKPPRS  360
SALSACYPTS QTLRRCEFLS KPKTVDHSNS LANDDTPEIL RCDASPNNSV KIISPNGKRV  420
SPPHGALGIS PNRKGGRKLI LKSIPSFPSL NGDVSNEHPA SYSSDPLS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1149155PKKKKRK
2150155KKKKRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.11e-104CPP family protein