gdsfmt: R Interface to CoreArray Genomic Data Structure (GDS) files

This package provides a high-level R interface to CoreArray Genomic Data Structure (GDS) data files, which are portable across platforms and include hierarchical structure to store multiple scalable array-oriented data sets with metadata information. It is suited for large-scale datasets, especially for data which are much larger than the available random-access memory. The gdsfmt package offers the efficient operations specifically designed for integers with less than 8 bits, since a single genetic/genomic variant, like single-nucleotide polymorphism, usually occupies fewer bits than a byte. Data compression and decompression are also supported with relatively efficient random access. It is allowed to read a GDS file in parallel with multiple R processes supported by the parallel package.

Package details

AuthorXiuwen Zheng [aut, cre], Jean-loup Gailly [ctb] (zlib library), Mark Adler [ctb] (zlib library), Yann Collet [ctb] (LZ4 library)
MaintainerXiuwen Zheng <zhengx@u.washington.edu>
LicenseLGPL-3
Version1.1.3
URL http://corearray.sourceforge.net/ http://github.com/zhengxwen/gdsfmt
Package repositoryView on R-Forge
Installation Install the latest version of this package by entering the following in R:
install.packages("gdsfmt", repos="http://R-Forge.R-project.org")

Try the gdsfmt package in your browser

Any scripts or data that you put into this service are public.

gdsfmt documentation built on May 2, 2019, 4:41 p.m.