knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  fig.path = "README-"
)

Introduction

MSIQ is developed for accurate and robust isoform quantification, by integrating multiple RNA-seq samples under a Bayesian framework. Our method aims to

Any suggestions on the package are welcome! For technical problems, please report to Issues. For suggestions and comments on the method, please contact Wei (liw@ucla.edu) or Dr. Jessica Li (jli@stat.ucla.edu).

Installation

The package is not on CRAN yet. For installation please use the following codes in R

install.packages("devtools")
library(devtools)

install_github("Vivianstats/MSIQ")

Usage

The package needs two types of input: GTF file and BAM files.

GTF file

MSIQ is an annotation-based method, so the users need to supply a GTF file from which MSIQ extracts the gene structures.

BAM files

MSIQ starts from alignment file in bam format. Each bam file represents one RNA-seq sample. The bam files need to be sorted and indexed, and the index files should be in the same folder as their corresponding bam files. Currently, the package only supports paired-end reads.

Example

Suppose the genes of interest are specified in the GTF file named as "example.gtf". We would like to estimate the transcript expression of these genes based on 6 RNA-seq samples. The bam files are named as "sample1.bam", ... , "sample4.bam". Suppose all the files are in the working directory, then we can use the following example codes to run the program

library(MSIQ)
D = 4 # number of samples
gtf_path = "example.gtf"
bam_path = c("sample1.bam", "sample2.bam", "sample3.bam", "sample4.bam")
ncores = 5
result = msiq(D, gtf_path, bam_path, ncores)

MSIQ summarizes the estimation results in MSIQ_result/transcript_summary.txt. The text file includes the following the columns:

Below is an example output for one gene:

transcriptID geneID frac FPKM_1 FPKM_2 FPKM_3 FPKM_4 sample1 sample2 sample3 sample4
ENST00000464356 ENSG00000116604 0.0996 0.0365 0.0549 0.016 0.0124 1 1 1 0
ENST00000348159 ENSG00000116604 0.2601 0.0892 0.1342 0.0391 0.0304 1 1 1 0
ENST00000360595 ENSG00000116604 0.629 0.4154 0.6249 0.1819 0.1417 1 1 1 0
ENST00000368240 ENSG00000116604 0.0018 0.0021 0.0031 9e-04 7e-04 1 1 1 0
ENST00000475587 ENSG00000116604 0.0032 0.0044 0.0067 0.0019 0.0015 1 1 1 0
ENST00000493077 ENSG00000116604 4e-04 0.0024 0.0036 0.0011 8e-04 1 1 1 0
ENST00000489057 ENSG00000116604 0.0059 0.0297 0.0447 0.013 0.0101 1 1 1 0


Vivianstats/MSIQ documentation built on May 10, 2019, 5:17 a.m.