sparklyr: R Interface to Apache Spark

R interface to Apache Spark, a fast and general engine for big data processing, see <>. This package supports connecting to local and remote Apache Spark clusters, provides a 'dplyr' compatible back-end, and provides an interface to Spark's built-in machine learning algorithms.

Getting started

Package details

AuthorJavier Luraschi [aut], Kevin Kuo [aut] (<>), Kevin Ushey [aut], JJ Allaire [aut], Samuel Macedo [ctb], Hossein Falaki [aut], Lu Wang [aut], Andy Zhang [aut], Yitao Li [aut] (<>), Jozef Hajnala [ctb], Maciej Szymkiewicz [ctb] (<>), Wil Davis [ctb], Edgar Ruiz [aut, cre], RStudio [cph], The Apache Software Foundation [aut, cph]
MaintainerEdgar Ruiz <>
LicenseApache License 2.0 | file LICENSE
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:

Try the sparklyr package in your browser

Any scripts or data that you put into this service are public.

sparklyr documentation built on Nov. 2, 2023, 5:09 p.m.