Home

/

GitHub

/

stemangiola/rpharma2020_tidytranscriptomics

/

README.md

README.md
In stemangiola/rpharma2020_tidytranscriptomics: Introduction to Tidy Transcriptomics

Introduction to Tidy Transcriptomics

rpharma2020 tidybulk

Maria Doyle
Stefano Mangiola

Material web page.

This material was created for the R/Pharma2020 conference workshop but it can also be used for self-learning.

More details on the workshop are below.

This is necessary in order to reproduce the code shown in the workshop. The workshop is designed for R 4.0 and can be installed using one of the two ways below.

If you're familiar with Docker you could use the Docker image which has all the software pre-configured to the correct versions.

docker run -e PASSWORD=abc -p 8787:8787 stemangiola/rpharma2020_tidytranscriptomics:rpharma2020

Once running, navigate to http://localhost:8787/ and then login with Username:rstudio and Password:abc.

You should see the Rmarkdown file with all the workshop code which you can run.

Alternatively, you could install the workshop using the commands below in R 4.0.

# Install same versions used in the workshop
#install.packages('remotes')
remotes::install_github(c("stemangiola/tidybulk@v1.1.8", "stemangiola/tidygate@v0.2.8", "stemangiola/tidyseurat@v0.1.8", "stemangiola/tidyHeatmap@v1.1.5"))

# Install workshop package
remotes::install_github("stemangiola/rpharma2020_tidytranscriptomics", build_vignettes = TRUE)

# To view vignettes
library(rpharma2020tidytranscriptomics)
browseVignettes("rpharma2020tidytranscriptomics")

To run the code, you could then copy and paste the code from the workshop vignette or R markdown file into a new R Markdown file on your computer.

This workshop will present how to perform analysis of RNA sequencing data following the tidy data paradigm. The tidy data paradigm provides a standard way to organise data values within a dataset, where each variable is a column, each observation is a row, and data is manipulated using an easy-to-understand vocabulary. Most importantly, the data structure remains consistent across manipulation and analysis functions.

This can be achieved for RNA sequencing data with the tidybulk, tidyseurat, tidyHeatmap and tidyverse packages. The tidybulk package provides a tidy data structure and a modular framework for bulk transcriptional analyses, the tidyseurat package provides similar for single-cell transcriptional analyses, and tidyHeatmap provides a tidy implementation of ComplexHeatmap. These packages are part of the tidytranscriptomics suite that introduces a tidy approach to RNA sequencing data.

Basic knowledge of RStudio
Familiarity with tidyverse syntax

Recommended Background Reading Introduction to R for Biologists

The workshop format is a 3 hour session consisting of hands-on demos, exercises and Q&A.

tidyverse
tidybulk
tidyseurat
tidyHeatmap
tidygate
limma
edgeR
DESeq2
airway
org.Hs.eg.db
org.Dm.eg.db
dittoSeq
ggrepel
GGally
plotly

Rough guide

| Activity | Time | |-------------------------------------------------------|------| | Part 1 Bulk RNA-seq Core | | | Hands-on Demos | 50m | | Introduction to RNA-seq | | | Data preprocessing | | | Data dimensionality reduction and clustering | | | Differential gene expression | | | Data visualisation | | | Exercises and Q&A | 10m | | Break | 5m | | Part 2 Bulk RNA-seq Extended | | | Hands-on Demos | 40m | | Comparison of differential testing tools | | | Cell type composition analysis | | | Q&A | 10m | | Break | 5m | | Part 3 Single-cell RNA-seq | | | Hands-on Demos | 30m | | Introduction to single-cell | | | Additional Q&A | 30m | | Total | 180m |

In exploring and analysing RNA sequencing data, there are a number of key concepts, such as filtering, scaling, dimensionality reduction, hypothesis testing, clustering and visualisation, that need to be understood. These concepts can be intuitively explained to new users, however, (i) the use of a heterogeneous vocabulary and jargon by methodologies/algorithms/packages, (ii) the complexity of data wrangling, and (iii) the coding burden, impede effective learning of the statistics and biology underlying an informed RNA sequencing analysis.

The tidytranscriptomics approach to RNA sequencing data analysis abstracts out the coding-related complexity and provides tools that use an intuitive and jargon-free vocabulary, enabling focus on the statistical and biological challenges.

Learning goals

To understand the key concepts and steps of RNA sequencing data analysis
To approach data representation and analysis though a tidy data paradigm, integrating tidyverse with tidybulk, tidyseurat and tidyHeatmap.

Learning objectives

Recall the key concepts of RNA sequencing data analysis
Apply the concepts to publicly available data
Create plots that summarise the information content of the data and analysis results

stemangiola/rpharma2020_tidytranscriptomics documentation built on Oct. 9, 2020, 9:41 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

stemangiola/rpharma2020_tidytranscriptomics
Introduction to Tidy Transcriptomics

README.md
In stemangiola/rpharma2020_tidytranscriptomics: Introduction to Tidy Transcriptomics

Introduction to Tidy Transcriptomics

Instructor names and contact information

Syllabus

Workshop package installation

Via Docker image

Via GitHub

Workshop Description

Pre-requisites

Workshop Participation

R / Bioconductor packages used

Time outline

Workshop goals and objectives

Learning goals

Learning objectives

R Package Documentation

Browse R Packages

We want your feedback!

stemangiola/rpharma2020_tidytranscriptomics Introduction to Tidy Transcriptomics

README.md In stemangiola/rpharma2020_tidytranscriptomics: Introduction to Tidy Transcriptomics

Introduction to Tidy Transcriptomics

Instructor names and contact information

Syllabus

Workshop package installation

Via Docker image

Via GitHub

Workshop Description

Pre-requisites

Workshop Participation

R / Bioconductor packages used

Time outline

Workshop goals and objectives

Learning goals

Learning objectives

R Package Documentation

Browse R Packages

We want your feedback!

stemangiola/rpharma2020_tidytranscriptomics
Introduction to Tidy Transcriptomics

README.md
In stemangiola/rpharma2020_tidytranscriptomics: Introduction to Tidy Transcriptomics