rstudio/sparklyr: R Interface to Apache Spark

R interface to Apache Spark, a fast and general engine for big data processing, see <>. This package supports connecting to local and remote Apache Spark clusters, provides a 'dplyr' compatible back-end, and provides an interface to Spark's built-in machine learning algorithms.

Getting started

Package details

MaintainerEdgar Ruiz <>
LicenseApache License 2.0 | file LICENSE
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
rstudio/sparklyr documentation built on Sept. 18, 2023, 10:31 p.m.