subsampling: Optimal Subsampling Methods for Statistical Models

Balancing computational and statistical efficiency, subsampling techniques offer a practical solution for handling large-scale data analysis. Subsampling methods enhance statistical modeling for massive datasets by efficiently drawing representative subsamples from full dataset based on tailored sampling probabilities. These probabilities are optimized for specific goals, such as minimizing the variance of coefficient estimates or reducing prediction error. Based on specified modeling assumptions and subsampling techniques, the package provides functions to draw subsamples from the full data, fit the model on the subsamples, and perform statistical inference.

Package details

AuthorQingkai Dong [aut, cre, cph], Yaqiong Yao [aut], Haiying Wang [aut], Qiang Zhang [ctb], Jun Yan [ctb]
MaintainerQingkai Dong <qingkai.dong@uconn.edu>
LicenseGPL-3
Version0.3.0
URL https://github.com/dqksnow/subsampling https://dqksnow.github.io/subsampling/
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("subsampling")

Try the subsampling package in your browser

Any scripts or data that you put into this service are public.

subsampling documentation built on March 11, 2026, 1:06 a.m.