r-spark/sparkhail: A 'Sparklyr' Extension for 'Hail'

'Hail' is an open-source, general-purpose, 'python' based data analysis tool with additional data types and methods for working with genomic data, see <https://hail.is/>. 'Hail' is built to scale and has first-class support for multi-dimensional structured data, like the genomic data in a genome-wide association study (GWAS). 'Hail' is exposed as a 'python' library, using primitives for distributed queries and linear algebra implemented in 'scala', 'spark', and increasingly 'C++'. The 'sparkhail' is an R extension using 'sparklyr' package. The idea is to help R users to use 'hail' functionalities with the well-know 'tidyverse' syntax, see <https://www.tidyverse.org/>.

Getting started

Package details

MaintainerSamuel MacĂȘdo <samuelmacedo@recife.ifpe.edu.br>
LicenseApache License 2.0 | file LICENSE
Version0.1.1
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("r-spark/sparkhail")
r-spark/sparkhail documentation built on Jan. 23, 2020, 2:28 p.m.