doppelgangR: Identify likely duplicate samples from genomic or meta-data

The main function is doppelgangR(), which takes as minimal input a list of ExpressionSet object, and searches all list pairs for duplicated samples. The search is based on the genomic data (exprs(eset)), phenotype/clinical data (pData(eset)), and "smoking guns" - supposedly unique identifiers found in pData(eset).

Package details

AuthorLevi Waldron [aut, cre], Markus Reister [aut, ctb], Marcel Ramos [ctb]
Bioconductor views GeneExpression ImmunoOncology Microarray QualityControl RNASeq
MaintainerLevi Waldron <lwaldron.research@gmail.com>
LicenseGPL (>=2.0)
Version1.18.0
URL https://github.com/lwaldron/doppelgangR
Package repositoryView on Bioconductor
Installation Install the latest version of this package by entering the following in R:
if (!requireNamespace("BiocManager", quietly = TRUE))
    install.packages("BiocManager")

BiocManager::install("doppelgangR")

Try the doppelgangR package in your browser

Any scripts or data that you put into this service are public.

doppelgangR documentation built on Nov. 8, 2020, 6:36 p.m.