PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC4BG004770.4
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family CPP
Protein Properties Length: 641aa    MW: 69984.7 Da    PI: 6.2049
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC4BG004770.4genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.73.5e-16453491240
               TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                       ++k+C+CkkskClk+YCeCfaag++Cse C+C++C Nk 
  TRIDC4BG004770.4 453 SCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKP 491
                       689**********************************96 PP

2TCR49.67.9e-16538577140
               TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                       ++k+gCnCkks+ClkkYCeC++ g+ Cs++C+Ce CkN+ 
  TRIDC4BG004770.4 538 RHKRGCNCKKSSCLKKYCECYQGGVGCSNNCRCETCKNTF 577
                       589***********************************85 PP

Sequence ? help Back to Top
Protein Sequence    Length: 641 aa     Download sequence    
DSLSPIEPLK SAYSGNGLQA YHQSLNAASV SSIFTSPHHN AHKEGSKLSK SSFGDYTENE  60
ISMEDGTDKN KSPTSSTAVR LFACTSTITR ESHTMITCSV NEGIVDPPKE PNDLPQPGRF  120
DSGSPDHNTA PCHGVSVRSD LKQDKCPKLE AVQATNNTVE KRKCLFSSDM QPQDGCQPAK  180
ENNEVMGCEW EDLVSVTSGE LLAFDSSMDQ HHTGVQLAVN NAESCGYLLS KLAGGADIPD  240
RTHPTTSSQA YYHEMVVGED KTENGQLFPE DKKTILSEEI QDNINEENAC IPLGCKVETQ  300
QRGVRRRCLV FEASGYSHRT VQKEYVGDLS FSTSKGKSSA QNHRNPGKTP SPHVFRGIGL  360
HLNALALTSK DKMACQDPLA TALVPSLKTE QDVHGNLLSA GGNFVHSGSG SLDLQMDNDD  420
CSVGGFLGND HNSSQSSSPP KKRRKSDNGD DDSCKRCSCK KSKCLKLYCE CFAAGVYCSE  480
PCSCQGCLNK PIHEEIVLST RKQIEFRNPL AFAPKVIRMS EAGQETQEDP KNTPASARHK  540
RGCNCKKSSC LKKYCECYQG GVGCSNNCRC ETCKNTFGTR DVAVSAENEE MKQEGDQTEN  600
REQEKENDQQ KANVHSEDHK LVELVVPITP PLDVSRFVGT F
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1440446PKKRRKS
2441445KKRRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.12e-65CPP family protein