datadr: Divide and Recombine for Large, Complex Data

Methods for dividing data into subsets, applying analytical methods to the subsets, and recombining the results. Comes with a generic MapReduce interface as well. Works with key-value pairs stored in memory, on local disk, or on HDFS, in the latter case using the R and Hadoop Integrated Programming Environment (RHIPE).

Package details

AuthorRyan Hafen [aut, cre], Landon Sego [ctb]
LicenseBSD_3_clause + file LICENSE
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:

Try the datadr package in your browser

Any scripts or data that you put into this service are public.

datadr documentation built on Aug. 19, 2018, 9:03 a.m.