doppelgangR: Identify likely duplicate samples from genomic or meta-data

The main function is doppelgangR(), which takes as minimal input a list of ExpressionSet object, and searches all list pairs for duplicated samples. The search is based on the genomic data (exprs(eset)), phenotype/clinical data (pData(eset)), and "smoking guns" - supposedly unique identifiers found in pData(eset).

Install the latest version of this package by entering the following in R:
AuthorLevi Waldron, Markus Riester, Marcel Ramos
Bioconductor views GeneExpression Microarray QualityControl RNASeq
Date of publicationNone
MaintainerLevi Waldron <>
LicenseGPL (>=2.0)

View on Bioconductor


corFinder Man page
DoppelGang Man page
DoppelGang-class Man page
doppelgangR Man page
doppelgangR-package Man page
dst Man page
mst.mle Man page
outlierFinder Man page
phenoDist Man page
phenoFinder Man page
plot,DoppelGang Man page
plot.DoppelGang Man page
plot,DoppelGang,ANY-method Man page
plot,DoppelGang-method Man page
plot.doppelgangR Man page
plot-methods Man page
print,DoppelGang-method Man page
print-methods Man page
pst Man page
qst Man page
rst Man page
show,DoppelGang-method Man page
show-methods Man page
smokingGunFinder Man page
st.mle Man page
summary,DoppelGang-method Man page
summary-methods Man page
vectorHammingDist Man page
vectorWeightedDist Man page

Questions? Problems? Suggestions? or email at

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.