rstudio/sparklyr: R Interface to Apache Spark

R interface to Apache Spark, a fast and general engine for big data processing, see <https://spark.apache.org/>. This package supports connecting to local and remote Apache Spark clusters, provides a 'dplyr' compatible back-end, and provides an interface to Spark's built-in machine learning algorithms.

Getting started

Package details

MaintainerEdgar Ruiz <edgar@rstudio.com>
LicenseApache License 2.0 | file LICENSE
Version1.8.5
URL https://spark.posit.co/
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("rstudio/sparklyr")
rstudio/sparklyr documentation built on March 29, 2024, 3:30 p.m.