Description Usage Arguments Details Value
View source: R/file_sample_prob.r
Randomly sample lines from an input text file.
1 2 3 4 5 6 7 8 9 | file_sample_prop(
p,
infile,
outfile = tempfile(),
header = TRUE,
nskip = 0,
nmax = 0,
verbose = FALSE
)
|
p |
Proportion to retain; should be a numeric value between 0 and 1. |
infile |
Location of the file (as a string) to be subsampled. |
outfile |
Output file location (as a string). |
header |
Is a header (line of column names) on the first line of the csv file? |
nskip |
Number of lines to skip. If |
nmax |
Max number of lines to read. If |
verbose |
Should linecounts of the input file and the number of lines sampled be printed? |
The sampling is done in one pass of the input file, dumping lines to a temporary file as the input is read.
If the output file (the one pointed to by the return of this function) is "large" and to be read into memory (which isn't really appropriate for text files in the first place!), then this strategy is probably not appropriate.
NULL
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.