Modeling and visualization of dynamics of alternative polyadenylation across biological samples
Alternative polyadenylation (APA) has been widely recognized as a widespread mechanism modulated dynamically. Studies based on 3′ end sequencing and/or RNA-seq have profiled poly(A) sites in various species with diverse pipelines, yet no unified and easy-to-use toolkit is available for comprehensive APA analyses. We developed an R package called movAPA for modeling and visualization of dynamics of alternative polyadenylation across biological samples. movAPA incorporates rich functions for preprocessing, annotation, and statistical analyses of poly(A) sites, identification of poly(A) signals, profiling of APA dynamics, and visualization. Particularly, seven metrics are provided for measuring the tissue-specificity or usages of APA sites across samples. Three methods are used for identifying 3′ UTR shortening/lengthening events between conditions. APA site switching involving non-3′ UTR polyadenylation can also be explored. Using poly(A) site data from rice and mouse sperm cells, we demonstrated the high scalability and flexibility of movAPA in profiling APA dynamics across tissues and single cells.
A. Poly(A) sites of biological samples obtained from 3’ seq or RNA-seq are stored in the PACdataset object and further preprocessed for the removal of internal priming artifacts and normalization.
B. The genome annotation file in GFF3/GTF format is processed, then poly(A) sites are annotated with rich information such as gene id, gene type, and genomic regions.
C. Statistical analyses can be conducted to profile the global landscape of poly(A) site distributions and count the overlap with other poly(A) site datasets.
D. Sequences surrounding poly(A) sites can be extracted and poly(A) signals of specific regions can be identified.
E. Three metrics can be adopted for the quantification of the usage of each poly(A) site across samples and four metrics are used for the quantification of dynamic APA site usage of a gene.
F. APA dynamics across biological samples can be profiled, including the detection of differentially expressed poly(A) sites and genes, 3′ UTR lengthening/shortening events, and canonical or non-canonical APA site switching events.
G. Rich functions are provided for the visualization of poly(A) site distributions and dynamic APA site usages across selected biological samples or single cells.
install.packages("devtools")
require(devtools)
install_github("BMILAB/movAPA")
library(movAPA)
browseVignettes('movAPA')
##or you can download ZIP, and then unzip
devtools::install_local("your_path_of_movAPA-master.zip", build_vignettes = TRUE)
This documentation describes how to read an external file of poly(A) sites and analyze it with movAPA. We used the model species – Arabidopsis for demonstration. Please refer to the vignette (PDF, HTML) for full details.
In this case study, we investigated the application of movAPA on a poly(A) site dataset of multiple tissues in Oryza sativa japonica from 3’ end sequencing. Please refer to the vignette (PDF, HTML) for full details. The data in the vignette can be downloaded from the refer branch-"demo_data".
## You can also browse the vignette using the following command on the R console
vignette("movAPA_on_rice_tissues", package = "movAPA")
movAPA is highly scalable and flexible, which can be used for profiling APA dynamics in single cells. Here we investigated the application of movAPA on poly(A) sites from mouse sperm cells. Poly(A) sites from three stages of differentiation process were obtained from the previous study (Shulman and Elkon, 2019), including early stage (spermatocytes, SC), intermediate stage (round spermatids, RS), and late stage (elongating spermatids, ES). Please refer to the vignette (PDF, HTML) for full details.
## You can also browse the vignette using the following command on the R console
vignette("movAPA_on_mouse_sperm_cells", package = "movAPA")
This documentation describes how to read an external file of single-cell poly(A) sites generated by scAPAtrap and analyze it with movAPA. Please refer to the vignette (PDF, HTML) for full details.
## You can also browse the vignette using the following command on the R console
vignette("movAPA_on_scAPAtrap_results", package = "movAPA")
If you are using movAPA, please cite: Wenbin Ye#, Tao Liu#, Hongjuan Fu, Congting Ye, Guoli Ji, and Xiaohui Wu: movAPA: Modeling and visualization of dynamics of alternative polyadenylation across biological samples, Bioinformatics, 2020.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.