knitr::opts_chunk$set( collapse = TRUE, comment = "#>" )
sparklyr.sedona is a sparklyr-based R interface for Apache Sedona. It presents what Apache Sedona has to offer through idiomatic frameworks and constructs in R (e.g., one can build spatial Spark SQL queries using Sedona UDFs in conjunction with a wide range of dplyr expressions), hence making Apache Sedona highly friendly for R users.
Generally speaking, when working with Apache Sedona, one choose between the following two modes:
While the former option enables more fine-grained control over low-level implementation details (e.g., which index to build for spatial queries, which data structure to use for spatial partitioning, etc), the latter is simpler and leads to a straightforward integration with dplyr
, sparklyr
, and other sparklyr
extensions (e.g., one can build ML feature extractors with Sedona UDFs and connect them with ML pipelines using ml_*()
family of functions in sparklyr
, hence creating ML workflows capable of understanding spatial data).
Because data from spatial RDDs can be imported into Spark dataframes as geometry columns and vice versa, one can switch between the abovementioned two modes fairly easily.
At the moment sparklyr.sedona
consists of the following components:
dplyr
-integration for Sedona spatial UDTs and UDFsAdd the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.