In wuxi-nextcode/topR: Create custom plots for viewing genetic association results using ggplot

knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  fig.path = "man/figures/README-",
  out.width = "120%"
)

topr

See full documentation at https://wuxi-nextcode.github.io/topr/

Installing from github using devtools

devtools::install_github("wuxi-nextcode/topr")

Example

In this example we demonstrate the basic usage of the topr library.

Load packages

First load the topr package, the tidyverse package is recommended in general, but not required for this example

library(topr)
library(tidyverse)
library(ggrepel)

Loading and exploring prebuilt datasets

Load the gwas_CD dataset, which is a subset of association results (SNPs with P<1e-03) for Crohn´s disease from the UK biobank.

It is highly recommended to theck the number of datapoints in your dataset before you plot, since a very large dataset will take a long time to plot.

paste("Number of SNPs in the dataset: [", length(CD_UKBB$POS),"]", sep = "")

Manhattan plots

Get an overview of association results for crohn's disease (CD) in a Manhattan plot

manhattan(CD_UKBB)

QQ plots

qqtopr(CD_UKBB,n_variants=length(CD_UKBB$POS))

Label the top SNPs with the name of their nearest gene

Use the annotate argument in the manhattan function to label the top SNPs with p-values below the annotate threshold with their nearest gene

manhattan(CD_UKBB, annotate = 1e-09)

Highlight genes of interest

manhattan(CD_UKBB, annotate = 1e-09, highlight_genes = c("NOD2","IL23R","JAK2"))

View one chromsome only

Take a closer look at the results by chromosome. Here we plot the results on chromosome 7 only.

manhattan(CD_UKBB, annotate = 1e-09, chr = "7")

Regionplot

Zoom in further on the chromosome plot with the regionplot function.

Zoom in on a gene of interest, e.g IKZF1:

regionplot(CD_UKBB, gene="IKZF1", annotate= 1e-09)

Zoom in on the top hit on a chromosome

CHR <- "chr1"
top_hit <- get_top_hit(CD_UKBB,chr=CHR)
regionplot(CD_UKBB, chr = CHR, xmin=top_hit$POS-250000 ,xmax= top_hit$POS+250000)

Display multiple phenotypes/datasets on the same plot

Display the output from more than one GWAS on the same plot

Manhattan multiple phenotypes

manhattan(list(CD_UKBB,CD_FINNGEN,UC_UKBB),legend_labels = c("CD UKBB","CD Finngen","UC UKBB"),title="IBD")

regionplot(list(CD_UKBB,CD_FINNGEN),gene="NOD2", annotate=c(1e-15, 1e-09), legend_labels = c("UKBB", "Finngen"), title="Crohn's disease (CD)")

manhattan(list(CD_UKBB,CD_FINNGEN,UC_UKBB),legend_labels = c("CD UKBB","CD Finngen","UC UKBB"),title="IBD")

The ntop argument

Use the ntop argument to set the number of datasets displayed at the top (default value is 3)

manhattan(list(CD_UKBB,CD_FINNGEN,UC_UKBB),ntop=2,legend_labels = c("CD UKBB","CD Finngen","UC UKBB"),title="IBD")

Useful functions

get_top_hit(CD_UKBB, chr="chr16")
dat1 <- get_best_snp_per_MB(CD_UKBB,thresh = 1e-07, region=1000000)
#get overlapping SNPS overlapping in two datasets
overlapping_snps <- dat1 %>% get_overlapping_snps_by_pos(CD_FINNGEN)

overlapping_snps_matched <- overlapping_snps %>% match_alleles()
overl_snps_matched_pos_allele_dat1 <- overlapping_snps_matched %>% flip_to_positive_allele_for_dat1()
snpset1 <- overl_snps_matched_pos_allele_dat1 %>% annotate_with_nearest_gene(protein_coding_only = T)

#or do all this in one go, by calling the create_snpset functon
snpset1 <- create_snpset(CD_FINNGEN, CD_UKBB, thresh = 1e-06)
snpset2 <- create_snpset(CD_UKBB, CD_FINNGEN, thresh= 1e-06)

e1 <- effect_plot(snpset1, pheno_x="CD Finngen", pheno_y="CD UKBB",color=get_topr_colors()[1], gene_label_thresh = 1)
e2 <- effect_plot(snpset2, pheno_x="CD UKBB", pheno_y="CD Finngen", color=get_topr_colors()[2], gene_label_thresh=1,annotate_with = "ID")
grid.arrange(e1,e2)

Plot apperance: setting text sizes

manhattan(list(CD_UKBB,CD_FINNGEN), annotate=1e-09,axis_title_size = 20,axis_text_size = 16,label_size = 5, title_text_size = 16, legend_text_size = 20)

regionplot(list(CD_UKBB,CD_FINNGEN), gene="IKZF1", vline=50274703,title="CD UKBB", title_text_size = 16,axis_title_size = 20,axis_text_size = 20,legend_text_size = 20)

Setting alpha, size and shape

wuxi-nextcode/topR documentation built on Dec. 23, 2021, 6:13 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

wuxi-nextcode/topR
Create custom plots for viewing genetic association results using ggplot

In wuxi-nextcode/topR: Create custom plots for viewing genetic association results using ggplot

topr

Installing from github using devtools

Example

Load packages

Loading and exploring prebuilt datasets

Manhattan plots

QQ plots

Label the top SNPs with the name of their nearest gene

Highlight genes of interest

View one chromsome only

Regionplot

Display multiple phenotypes/datasets on the same plot

Manhattan multiple phenotypes

The ntop argument

Useful functions

Plot apperance: setting text sizes

R Package Documentation

Browse R Packages

We want your feedback!

wuxi-nextcode/topR Create custom plots for viewing genetic association results using ggplot

In wuxi-nextcode/topR: Create custom plots for viewing genetic association results using ggplot

topr

Installing from github using devtools

Example

Load packages

Loading and exploring prebuilt datasets

Manhattan plots

QQ plots

Label the top SNPs with the name of their nearest gene

Highlight genes of interest

View one chromsome only

Regionplot

Display multiple phenotypes/datasets on the same plot

Manhattan multiple phenotypes

The ntop argument

Useful functions

Plot apperance: setting text sizes

R Package Documentation

Browse R Packages

We want your feedback!

wuxi-nextcode/topR
Create custom plots for viewing genetic association results using ggplot