PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PON76947.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Cannabaceae; Trema
Family CPP
Protein Properties Length: 964aa    MW: 105300 Da    PI: 6.0881
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PON76947.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.22.2e-15536574341
         TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                 +k+CnC+kskClk+YC+Cfaag +Cs++C C+dC+Nk+e
  PON76947.1 536 CKRCNCRKSKCLKLYCDCFAAGIYCSDSCACQDCFNKSE 574
                 89**********************************987 PP

2TCR505.8e-16622660139
         TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                 ++k+gCnCkks ClkkYCeC++a++ Cs+ C+Ce+C+N 
  PON76947.1 622 RHKRGCNCKKSMCLKKYCECYQAHVGCSSGCRCEGCQNI 660
                 589***********************************6 PP

Sequence ? help Back to Top
Protein Sequence    Length: 964 aa     Download sequence    
MDSPETAQVT ISTTTSAIDS PPVQDSPFFN YASNLSPIKP VKTTHVAQIS GFSSPPIVFK  60
SPRTKQHSET NVLKRSQCLQ SSFTEASQNV DRGNKCIVDS GQSTTLCQQG LIIETQKDEV  120
DEFFETHPCS SSGCINEFLA DLEDEDCENS YSATPSLKHS NDALGSSQSP SNNSKETTLK  180
FGDKNGMGAD AGTELEVLLV SLEQAKKDIQ GEPKSDAEPI KIEEEKNDSE WPSNECPNIR  240
IGVHVGHASK KHECLDSLAQ SIESGCQTDA DGTSKPLPLP SQIAQTYKDH SEDVGAVSDG  300
IVENIVRQGY KAFENFGVRR RCLQYEEAHL NTIGDSDSSL NLANTVDSLE LSSSATELET  360
SELCHVDSKA TSSEQHIVNL PQSTATLLHP RYCDKLSVLS KPLGIGLHLN SIVNASPMNY  420
TATASKKSAD QNVGPQGMSS SMTDYNLLEN TKSCSMSVTT VKQVSACDGE TNVIKALIVA  480
SSAISELPPI VQEHYAAPDD NRKSNSQNAD SLEEHDPPSP RKKRKKTANS VGFEGCKRCN  540
CRKSKCLKLY CDCFAAGIYC SDSCACQDCF NKSEYEDTIR ETREQIESRN PLAFAPKIVQ  600
RIPKLPPNNG EDGNQSTPSS ARHKRGCNCK KSMCLKKYCE CYQAHVGCSS GCRCEGCQNI  660
FGKKEEYVAM EHGVSREMVS NRACKELLES PSEPDIEETN RDSLCTELFN PHYLTPVTPS  720
FRHSDHGKHA PKSRLLSIRN LSSPDDLSIL PSNEKSTKSS PRDLEESHIL LETSELLNEC  780
SHDWQADYDT GAVGSVSSRC DVVSHMRDLT QLSDTPPISM ISSTLFRRKD NRNVPQNQLS  840
HGIDRLSSSG TLQWHGSPLT PMTKLSGTKS HQGHDLDSGS GHNDILENEM PVKISSPNKK  900
RVSPPRGHSH VRELGSGSLG GLRSGRKFIL KSVPSFPPLT PCTDSKGSTG TQTTYKLEEN  960
SDKK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1522526KKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.13e-58CPP family protein