carbocation/aberrant: A robust clustering algorithm for outlier identification

High-throughput genotyping arrays provide an efficient way to survey single nucleotide polymorphisms (SNPs) across the genome in large numbers of individuals. Downstream analysis of the data, for example in genome-wide association studies (GWAS), often involves statistical models of genotype frequencies across individuals. The complexities of the sample collection process and the potential for errors in the experimental assay can lead to biases and artefacts in an individual's inferred genotypes. Rather than attempting to model these complications, it has become standard practice to remove individuals whose genome-wide data differs from the sample at large. This package contains a simple, but robust, clustering algorithm to identify samples with atypical summaries of genome-wide variation.

Getting started

Package details

AuthorCeline Bellenguez and Chris CA Spencer
MaintainerChris CA Spencer <chris.spencer@well.ox.ac.uk>
LicenseGPL (>= 2)
Version1.0
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("carbocation/aberrant")
carbocation/aberrant documentation built on May 15, 2020, 6:04 p.m.