Description Usage Arguments Details Value Author(s)
View source: R/plot_signif_distribs.r
Plot the distributions of case and control variant residues across multiple genes
1 2 3 4 5 6 7 8 9 10 11 | plot_signif_distribs(
pvals,
cases,
controls,
case_coverage,
control_coverage,
test = "BIN-test",
cov_threshold = 0.5,
n_genes = 20,
SCALE = 1
)
|
pvals |
pvals data.table from ClusterBurden WES analysis with at least symbol and p-value column for test of interest |
cases |
case data in format: data.table(aff, symbol, protein_position, ac) |
controls |
control data in format: data.table(aff, symbol, protein_position, ac) |
case_coverage |
optional coverage data for cases in format: data.table(symbol, protein_position, over_10) |
control_coverage |
optional coverage data for controls in format: data.table(symbol, protein_position, over_10) |
test |
either bin-test, burden or clusterburden (not case sensitive) |
cov_threshold |
threshold at which to exclude a residue position from the analysis (choose 0 to keep all residues) |
n_genes |
number of genes to plot e.g. n_genes = 5 plots the 5 most significant genes |
SCALE |
scale of the plot |
For each gene (n_genes=?): produces a stripchart using geom_jitter for both cases and controls as well as a gaussian density line for each cohort. If coverage files are provided then these are included as a rug underneath and above the case and control variants and are colored by their coverage level (< cov_threshold, < 80
Returns a ggplot object
Adam Waring - adam.waring@msdtc.ox.ac.uk
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.