This is a 975-sample subset of the EMP (Earth Microbiome Project) dataset containing 150bp-length sequences. All samples were picked randomly and evenly across 17 habitats and then evenly across 97 studies in each sample type.
This package provides the following data frames:
biom_sub2k_150bp
: observation-sequence counts per sampleobservation_metadata_sub2k150
: observation-sequence taxonomysample_metadata_sub2k
: sample metadata# install.packages("devtools")
devtools::install_github("nikopech/empdata")
library(empdata)
str(biom_sub2k_150bp)
# str(observation_metadata_sub2k150)
# str(sample_metadata_sub2k)
All data sets were collected from the FTP site of the Earth Microbiome Project.
Sample processing, sequencing, and core amplicon data analysis were performed by the Earth Microbiome Project (www.earthmicrobiome.org), and all amplicon sequence data and metadata have been made public through the EMP data portal (qiita.microbio.me/emp).
Please cite the following publication if you use any of them:
Thompson, L., Sanders, J., McDonald, D. et al. A communal catalogue reveals Earth’s multiscale microbial diversity. Nature 551, 457–463 (2017). https://doi.org/10.1038/nature24621
Please see the LICENSE file for details
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.