RBigData/pbdSHAQ: Tools for Tall Distributed Matrices

Many data science problems reduce to operations on very tall, skinny matrices. However, sometimes these matrices can be so tall that they are difficult to work with, or do not even fit into main memory. One strategy to deal with such objects is to distribute their rows across several processors. To this end, we offer an 'S4' class for tall, skinny, distributed matrices, called the 'shaq'. We also provide many useful numerical methods and statistics operations for operating on these distributed objects. The naming is a bit "tongue-in-cheek", with the class a play on the fact that 'Shaquille' 'ONeal' ('Shaq') is very tall, and he starred in the film 'Kazaam'.

Getting started

Package details

MaintainerDrew Schmidt <wrathematics@gmail.com>
LicenseBSD 2-clause License + file LICENSE
Version0.2-0
URL https://pbdr.org/
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("RBigData/pbdSHAQ")
RBigData/pbdSHAQ documentation built on Nov. 9, 2021, 9:10 a.m.