In MRCToxBioinformatics/Proteomics_data_analysis: Basic proteomics data analysis tutorials

Visualisation of proteomics data is a very broad topic, with lots of potential bespoke visualisations one may want to perform. This notebook is intended to provide a brief overview of how you can extract your proteomics data and manipulate it to prepare it for visualisation

Load dependencies

Load the required libraries.

library(camprotR)
library(MSnbase)
library(dplyr)
library(tidyr)
library(ggplot2)
library(pheatmap)

Here, we'll use the protein-level quantification processed in Processing and QC of TMT data. Please see the previous notebook for details of the data processing.

tmt_protein <- readRDS('./results/tmt_protein.rds')

As a reminder, this data comes from a published benchmark experiment where yeast peptides were spiked into human peptides at 3 known amounts to provide ground truth fold changes (see below). Two versions of Orbitrap control software were tested, with v3 shown to improve fold-change estimation for low intensity peptides. Here, we will use the data obtained with v3. For more details, see: [@http://zotero.org/users/5634351/items/LG3W8G4T]

Single gene plots

single_protein <- tmt_protein[1,] %>% # taking the first protein as an example
  biobroom::tidy.MSnSet(addPheno=TRUE)

print(single_protein)

single_protein %>%
  ggplot() +
  aes(spike, value) +
  geom_point() +
  stat_summary(geom='errorbar', width=0.2) +
  theme_camprot() +
  labs(x='', y='Protein abundance (log2)')

Heatmap

set.seed(0)
random_proteins <- sample(rownames(tmt_protein), 500, replace=FALSE)

# update with actual species annotations!
ann_row <- data.frame(species=rep(c('yeast', 'human'), 250), row.names=random_proteins)

ann_col <- list('species'=c('yeast'=get_cat_palette(1),
                            'human'=get_cat_palette(2)[2]))

tmt_protein[random_proteins,] %>% exprs()%>% t() %>% scale() %>% t() %>%
  pheatmap(show_rownames=FALSE, cluster_cols=FALSE, border_color=NA)

tmt_protein[random_proteins,] %>% exprs()%>% t() %>% scale() %>% t() %>%
  pheatmap(show_rownames=FALSE, cluster_cols=FALSE, border_color=NA,
           annotation_row=ann_row, annotation_colors=ann_col)

MRCToxBioinformatics/Proteomics_data_analysis documentation built on July 7, 2024, 6:44 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

MRCToxBioinformatics/Proteomics_data_analysis
Basic proteomics data analysis tutorials

In MRCToxBioinformatics/Proteomics_data_analysis: Basic proteomics data analysis tutorials

Load dependencies

Single gene plots

Heatmap

R Package Documentation

Browse R Packages

We want your feedback!

MRCToxBioinformatics/Proteomics_data_analysis Basic proteomics data analysis tutorials

In MRCToxBioinformatics/Proteomics_data_analysis: Basic proteomics data analysis tutorials

Load dependencies

Single gene plots

Heatmap

R Package Documentation

Browse R Packages

We want your feedback!

MRCToxBioinformatics/Proteomics_data_analysis
Basic proteomics data analysis tutorials