PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_021670958.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Euphorbiaceae; Crotonoideae; Micrandreae; Hevea
Family CPP
Protein Properties Length: 920aa    MW: 100534 Da    PI: 6.7903
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_021670958.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR47.34e-15501538340
             TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                     +k+CnCk++kClk+YC+Cfaag +C+e C C++C+N+ 
  XP_021670958.1 501 CKRCNCKRTKCLKLYCDCFAAGIYCAEPCACQGCFNRP 538
                     89**********************************86 PP

2TCR50.34.6e-16584622139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCk+s ClkkYCeC++a++ Cs+eC+Ce+CkN 
  XP_021670958.1 584 RHKRGCNCKRSMCLKKYCECYQANVGCSSECRCEGCKNV 622
                     589***********************************6 PP

Sequence ? help Back to Top
Protein Sequence    Length: 920 aa     Download sequence    
MDSPQPANTT TTAAVTAASL SDSPPVQESP FSNYISNLSP IKPVKTAHVP QGFLGVSSPP  60
LVFTSPRTIP HRETSFFQRS QFTQIASAEI PENDDGRKNF AGLSNDIGES DNYSSKLIAD  120
VRQNNDGENS ARDQPGSSSG CVDEYLYDPV DVDCASSANL VNPNAKQSND VLQSSVSSLT  180
DSNKGILKSD GKNHLRAEVD KSQALSKQAE EDVQGQSRFE IKPVQVEEDQ SGNKKSSIKC  240
PNVQSDHASE KNQCDVLETQ VVQAHEDYNE NVAASLQGAM HNMVQLEQEA SQLQRGLSRR  300
CLQFEEAQWK TIVNSTCSPN LTNYVTGSGS PGSATELESL NSSLVDLTDS SNKKEMVNLS  360
RPATSMFPLR CNEKSPIVVS KPSGIGLHLN SIVNTLPVGH TAAASIKSSN CLKLSNLVEK  420
VPITPKDRML ETKASLAASD TTAESFHNAE PLNMLQSLGH QLTPSNKRKF NPEHEDNFEE  480
VGQESPTKKK RKKSSLDGEG CKRCNCKRTK CLKLYCDCFA AGIYCAEPCA CQGCFNRPDY  540
EDTVLETRQQ IESRNPLAFA PKIVPHVTGF AAEDGNQLMP SLARHKRGCN CKRSMCLKKY  600
CECYQANVGC SSECRCEGCK NVYGRKEEYG RTGEIASNIV GEERLDGRIH DKLEMMATNK  660
DLLHAELYDL RNLTPSTPSF QHSDHGKDAQ KSRFNSSRYV PSPQSDFSIL PSYAKSTRSP  720
RNSHNNNMIP ETSEEILDID TCGQGMDYNV ADMMNQISPR HNALENICDL TPFQNPSMTG  780
ASSASSKARD WAGGSGLQLC PGSGCFSSGR SLRWRSSPIT PMTRLVESKN QEHGTDSGLY  840
DIQEDDTPEI LKEASTPVTS VKASSPNKKR VSPPHKHIQD LRSSSSGGLK SGRKFILKAV  900
PSFPPLTPCI DSKGSKNEKQ
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1489493KKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.16e-61CPP family protein