Meta-analysis has become a popular approach for high-throughput genomic data analysis because it often can significantly increase power to detect biological signals or patterns in datasets. However, when using public-available databases for meta-analysis, duplication of samples is an often encountered problem, especially for gene expression data. Not removing duplicates would make study results questionable. We developed a Bioconductor package DupChecker that efficiently identifies duplicated samples by generating MD5 fingerprints for raw data.
Package details |
|
---|---|
Author | Quanhu Sheng, Yu Shyr, Xi Chen |
Bioconductor views | Preprocessing |
Maintainer | "Quanhu SHENG" <shengqh@gmail.com> |
License | GPL (>= 2) |
Version | 1.25.0 |
Package repository | View on Bioconductor |
Installation |
Install the latest version of this package by entering the following in R:
|
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.