PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID evm.model.AsparagusV1_06.903
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; Asparagales; Asparagaceae; Asparagoideae; Asparagus
Family CPP
Protein Properties Length: 1041aa    MW: 114606 Da    PI: 6.6241
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
evm.model.AsparagusV1_06.903genomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR49.11.2e-15640678442
                           TCR   4 kgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkeek 42 
                                   k+CnCk+skClk+YC+Cfa g++Cse C C dC+N++e+
  evm.model.AsparagusV1_06.903 640 KRCNCKRSKCLKLYCDCFAVGSFCSEACACIDCSNRTEH 678
                                   89*********************************9886 PP

2TCR52.59.7e-17724763140
                           TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                                   ++k+gCnCkks ClkkYCeCf+ag+ Cs+ C+Ce+CkN+ 
  evm.model.AsparagusV1_06.903 724 RHKRGCNCKKSLCLKKYCECFQAGVGCSSGCRCEGCKNTF 763
                                   589***********************************86 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1041 aa     Download sequence    
MTSSESDKPD PKSPPVQDSP IFGYVSSLSP INPVNARSIV QVYENVDIPS PQLVFKTPNF  60
SRSNLLGRSQ LLALNNVENN SPGYRNNTRE YSENVDVNGT SQPTREFVSH RDNECPSGDP  120
SPTQSFTSPS NCVDDFLADP SENGNCSSSS PEFCVKRACK DIEYDNASCC ENQTESLKED  180
LLRPHPCTSA ISTSKQTDDS LHSKRDSHTS DTLVNDSINH KESYLELHAA DMKKHTNKTA  240
VISFRGLYQT EEDVESMSMA AETMEIDKGL IAAQMGQVPK DVRDYTNAHS SDFIIWDRDN  300
SVFQGAIGSS SVSRANICDL SHDEKGNSTT SAFLCQGERE PKLQTTETCQ PNELDCTSQL  360
PRDFHEKIQT GNNNIKITVP PISTSAGNDL EDSIQHQRGK RKRLEFEAAD LSNRNSFNYT  420
TEASNLDLPI APLELESVGA SHAESNAVSS HKQVVHYTST NAPCRVSVRV DKSGQCKESL  480
VAAPRPAGIG LHLNSIGNAA STNYNSDMQL AERNAGLHGK SLSLDIEQAS KNSKSCLVSV  540
SLMHPQKFTF SAEKPLHYSG NERNQERQQG DQGFQQHHSS VNYHSPLAAK PLNSSAPLKH  600
MDHYMTPCNI KGLEEAKKSE ESTQTSPSKK RKRTPANGPK RCNCKRSKCL KLYCDCFAVG  660
SFCSEACACI DCSNRTEHEE IVREARQQIE SRNPLAFAPK VLLRVSDTPK GIGDDEPIPP  720
TSARHKRGCN CKKSLCLKKY CECFQAGVGC SSGCRCEGCK NTFGKKDEYE ISEVTVEHKR  780
PKEASLERDL SGELESIEVK GQITNAKRRH SRFSPLTPLL QSSMGNDLPK HHLPSSLFPS  840
PDSVASIVRS YESSDADLDA MQNDHNTEKV DPFSPGWDVF SDIYNLSPLL KTSVSPSAGS  900
SSAASKTREH KIFKFPQGST KISLGSLRWH NSPITPIPEF EESEFVRESD SNSGVPCNQE  960
DDTPDILKET RSPIKMVKSS SPNSKRVSPP HRILCSTSES PPPSGLRNVR KFILQSMPAF  1020
PPLTPYSKSN KEGTKEDEFT *
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1629634KKRKRT
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-56CPP family protein