In davemcg/eyeIntegration_app: eyeIntegration Shiny App Code

tximport guide

https://bioconductor.org/packages/release/bioc/vignettes/tximport/inst/doc/tximport.html#import_transcript-level_estimates

Load libraries, import count data and metadata

The tx_counts import will take some time, as this file is over 300mb

You can also swap out for transcript counts if you want (use instead)

library(tidyverse)
library(tximport)
#https://eyeintegration.nei.nih.gov -> Data -> Data Download for links
metadata <- read_tsv('https://hpc.nih.gov/~mcgaugheyd/eyeIntegration/2019_metadata_04.tsv.gz')
gene_counts <- read_csv('https://hpc.nih.gov/~mcgaugheyd/eyeIntegration/2019_12_gene_counts_04.csv.gz')

Load annotation data matched with `EiaD 2019` (`gencode.v29.annotation.gtf.gz`)

gtf <- rtracklayer::readGFF('ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_29/gencode.v29.annotation.gtf.gz') %>% 
  dplyr::filter(type=='transcript') %>% 
  dplyr::mutate(gene_type = 'protein_coding')
anno <- gtf[,c("gene_id", "gene_name", "transcript_id", "gene_type")]

Import YOUR personal RNA-seq data

Use Salmon or kallisto with:
gencode.v29.annotation.gtf.gz
gencode.v29.transcripts.fa.gz
Use tximport to load your Salmon or kallisto data into R

Using SRA `ERR2303761` for this example

Salmon version 1.0 used here

This below chunk is NOT run in R, but on the command line. ```{bash, eval = FALSE}

sratoolkit

fasterq-dump --include-technical --split-files ERR2303761

get ref files

wget ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_29/gencode.v29.annotation.gtf.gz wget ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_29/gencode.v29.transcripts.fa.gz

salmon index

salmon index -t gencode.v29.transcripts.fa.gz --gencode -i salmon_index -k 31

salmon quant

salmon quant --libType A -i salmon_index --validateMappings --gcBias --seqBias -p 8 -1 ERR2303761_1.fastq -2 ERR2303761_2.fastq -o ERR2303761_quant

```r
# import with tximport
txi <- tximport(c('ERR2303761_quant/quant.sf'), type = 'salmon', countsFromAbundance = 'no', txOut = FALSE, tx2gene = anno[,c(3,2)])

Glue EiaD data into the `txi` object

We'll take 5 random adult cornea samples

EiaD_samples <- metadata %>% filter(Sub_Tissue == 'Cornea - Adult Tissue', Kept == 'Kept') %>% sample_n(5) %>% pull(sample_accession)

# match up row names
EiaD_data <- data.frame(gene_counts[, EiaD_samples])
row.names(EiaD_data) <- gene_counts$ID
EiaD_data_subset <- EiaD_data[row.names(txi$counts), EiaD_samples]

# update txi$counts colnames to something informative
colnames(txi$counts) <- c('ERR2303761')
txi$counts <- cbind(txi$counts, EiaD_data_subset) %>% as.matrix()

# fudge for now - copy length data  of first to match number of samples you add
# will add length data to website soon (ask if you don't see it!!!!)
txi$length <- cbind(txi$length, replicate(ncol(EiaD_data_subset), txi$length[,1]))

# confirm it work by looking at a few rows (if you see NA a problem has happened)
txi$counts %>% as_tibble(rownames = 'ID') %>% sample_n(10)

Now you can go to DESeq2, or edgeR, or limma, or etc.

https://bioconductor.org/packages/release/bioc/vignettes/tximport/inst/doc/tximport.html#deseq2

Build DESeq2 object

After this, you can use the DESeq2 tutorial: http://bioconductor.org/packages/release/bioc/vignettes/DESeq2/inst/doc/DESeq2.html

library(DESeq2)
sampleTable <- data.frame(condition = factor(c("ERR2303761", rep(c("Adult Cornea"), 5))))
rownames(sampleTable) <- colnames(txi$counts)
dds <- DESeqDataSetFromTximport(txi, sampleTable, ~condition)

devtools::session_info()

davemcg/eyeIntegration_app documentation built on May 18, 2024, 1:37 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

davemcg/eyeIntegration_app
eyeIntegration Shiny App Code

In davemcg/eyeIntegration_app: eyeIntegration Shiny App Code

tximport guide

Load libraries, import count data and metadata

Load annotation data matched with `EiaD 2019` (`gencode.v29.annotation.gtf.gz`)

Import YOUR personal RNA-seq data

Using SRA `ERR2303761` for this example

sratoolkit

get ref files

salmon index

salmon quant

Glue EiaD data into the `txi` object

Now you can go to DESeq2, or edgeR, or limma, or etc.

Build DESeq2 object

R Package Documentation

Browse R Packages

We want your feedback!

davemcg/eyeIntegration_app eyeIntegration Shiny App Code

In davemcg/eyeIntegration_app: eyeIntegration Shiny App Code

tximport guide

Load libraries, import count data and metadata

Load annotation data matched with EiaD 2019 (gencode.v29.annotation.gtf.gz)

Import YOUR personal RNA-seq data

Using SRA ERR2303761 for this example

sratoolkit

get ref files

salmon index

salmon quant

Glue EiaD data into the txi object

Now you can go to DESeq2, or edgeR, or limma, or etc.

Build DESeq2 object

R Package Documentation

Browse R Packages

We want your feedback!

davemcg/eyeIntegration_app
eyeIntegration Shiny App Code

Load annotation data matched with `EiaD 2019` (`gencode.v29.annotation.gtf.gz`)

Using SRA `ERR2303761` for this example

Glue EiaD data into the `txi` object