A collection of utilities for reading subsamples of flat text files by line in a reasonably efficient manner. We do so by sampling as the input file is scanned and randomly choosing whether or not to dump the current line to an external temporary file. This temporary file is then read back into R. For (aggressive) 'downsampling', this is a very effective strategy; for resampling, you are much better off reading the full dataset into memory.
Package details |
|
---|---|
Maintainer | Drew Schmidt <wrathematics@gmail.com> |
License | BSD 2-clause License + file LICENSE |
Version | 0.4-0 |
URL | https://github.com/wrathematics/filesampler |
Package repository | View on GitHub |
Installation |
Install the latest version of this package by entering the following in R:
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.