DENTIST (Detecting Errors iN summary staTISTics) was designed to evaluate and quality control summary statistics from association studies. Leveraging the LD information from an independent reference sample, DENTIST can distinguish the SNPs with inaccurate statistics in summary data, which are caused by poor genotyping/imputation as well as other artefacts occurred during an association study. As is assessed in our paper, a few of the frequently-used summary-based analysis, such as the meta-analysis, GCTA-COJO, SMR and LD score regression [PubMed ID: 25642630], can be biased to different degrees and improved with DENTIST-based QC.
Latest release v0.6.0 (22 Mar 2019)
The method was designed by Wenhan Chen, Zhihong Zhu and Jian Yang. The software was implemented and has been maintained by Wenhan Chen. The idea of the LD consistency test was originated from a previous study [PubMed ID: 24990607]. Webpage supports are from Zhili Zheng.
If you have any bug reports or questions please send an email to Wenhan Chen or Jian Yang.
DENTIST In preparation.
Last update: 22 Mar 2019
The executable file below is compiled with "-static", and tested on 64-bit Linux distributions on the x86_64 CPU platform.
Linux DENTIST_0.5.0.zip
The executable file is released under the MIT license.
Note: The source code of DENTIST is still under frequent changes. It is now only obtainable by an email of request.
--bfile test Reads individual-level genotype data in PLINK bed format, e.g. test.fam, test.bim and test.bed.
--gwas-summary summary.txt Reads GWAS summary data in in GCTA-COJO format.
Format of the GCTA-COJO file, summary.txt,
SNP A1 A2 freq beta se p N
rs131538 A G 0.05 0.007 0.02 0.7 6000
rs140378 C G 0.05 0.007 0.02 0.7 6000
...
--out tmp Specifies output file prefix. In this case, it outputs "tmp.qc.DENTIST.txt" in the format of "rsID zscore1 zscore2 mRsq ifDup" as follows,
rs131538 0.07 -0.01 0.70 0
rs140378 0.07 -0.01 0.70 0
...
--target rs101 Is trailed by an rsID to specify a region of 20Mb of interest centered at position specified by rsID. The rsID should exist in the bfile. A warning is raised if the target rsID is not found. Notably, the identification of this rsID in the bfile is preformed before --maf flag.
--wind 4000 Is trailed by the number of markers for the size of a sliding window. The default value is 4000 markers. This is the default option unless overruled by --wind-dist.
--wind-dist 2000000 Is trailed by sliding window size measured in the number of BP. The default value is 2000000 BP. This flag overrules --wind.
--thread-num 4 Specifies the number of threads for parallel computing, given the tools is powered by OpenMP. The default value is 1.
--num-iterations 10 Specifies the number of iterations for LD consistency test (Method). The default value is 10.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.