PPCDT: An Optimal Subset Selection for Distributed Hypothesis Testing

In the era of big data, data redundancy and distributed characteristics present novel challenges to data analysis. This package introduces a method for estimating optimal subsets of redundant distributed data, based on PPCDT (Conjunction of Power and P-value in Distributed Settings). Leveraging PPC technology, this approach can efficiently extract valuable information from redundant distributed data and determine the optimal subset. Experimental results demonstrate that this method not only enhances data quality and utilization efficiency but also assesses its performance effectively. The philosophy of the package is described in Guo G. (2020) <doi:10.1007/s00180-020-00974-4>.

Getting started

Package details

AuthorGuangbao Guo [aut, cre, cph], Jiarui Li [ctb]
MaintainerGuangbao Guo <ggb11111111@163.com>
LicenseApache License (== 2.0)
Version0.2.0
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("PPCDT")

Try the PPCDT package in your browser

Any scripts or data that you put into this service are public.

PPCDT documentation built on Sept. 11, 2024, 9:24 p.m.