This R package has code and data for papers by Jeffrey M. Dick. Plots from the papers are reproduced in the vignettes, which are installed with the package and can be viewed at https://chnosz.net/JMDplots/vignettes/.
Click on the paper titles for a list of files. Published papers are indicated by the year with a DOI link. Links to preprints, if available, are at the end of each list. See the manual page associated with each paper for additional details about scripts, data files, and plotting functions.
genoGOE
: Earth's oxygenation recorded in genomes (in-preparation manuscript)
inst/extdata/genoGOE: scripts and processed data files
GTDB: Processing data from GTDB with additional features
methanogen: sequences and amino acid compositions generated using process_GTDB.R
MCK+23: Data for genomes with sulfur-cycling genes from Mateos et al. (2023)
GMKK20: Data for extant and reconstructed ancestral nitrogenase sequences taken from Garcia et al. (2020)
inst/extdata/evdevH2O/LMM16: scripts and processed data files for consensus gene ages from Liebeskind et al. (2016), modified from the files used by Dick (2022)
../canprot/inst/extdata/fasta/KHAB17.fasta: reconstructed ancestral Rubisco sequences taken from Kaçar et al. (2017)
microhum
: Adaptations of microbial genomes to human body chemistry (submitted manuscript)
inst/extdata/microhum: scripts and processed data files
ARAST: analysis of metagenomes
KWL22: analysis of metagenome-assembled genomes (MAGs) from Ke et al. (2022)
metaproteome: analysis of metaproteomes
16S: analysis of 16S rRNA gene sequences
MR18_Table_S1_modified.csv: data: List of Prokaryotes according to their Aerotolerant or Obligate Anaerobic Metabolism, modified from Million and Raoult (2018)
Figure_5_genera.txt: data: List of genera in Figure 5, created from the value invisibly returned by microhum_5()
.
R/microhum.R: code for plots
vignettes/microhum.Rmd: vignette including Figures 1–6 and figure supplements.
microhum.html: compiled HTML version of the vignette (external link)
bioRxiv: preprint (external link)
chem16S
: Community-level chemical metrics for exploring genomic adaptation to environments (2023)
vignettes/chem16S.Rmd: vignette including Figure 1
../chem16S/inst/extdata: scripts and processed data files (NOTE: these files are in the chem16S package; see chem16S-package.Rd for details)
orp16S
: Community- and genome-based evidence for a shaping influence of redox potential on bacterial protein evolution (2023)
inst/extdata/orp16S: scripts and processed data files
metaproteome: analysis of metaproteomes
R/orp16S.R: code for plots
vignettes/orp16S.Rmd: vignette including Figures 1–6, S1–S2, and Table 1
bioRxiv: preprint (external link)
utogig
: Using thermodynamics to obtain geochemical information from genomes (2023)
vignettes/utogig.Rmd: vignette including Figures 1–4, S1–S4, Table S6, and conversions between redox scales
Amino acid compositions and taxonomic information have been obtained from the Saccharomyces Genome Database (SGD), UniProt, RefSeq, GTDB, and MGnify. See man/JMDplots-package.Rd for further details.
Reference databasesinst/extdata/RefDB/organisms: Data for particular organisms, downloaded from SGD or UniProt.
Sce.csv.xz: Saccharomyces cerevisiae (used in the scsc and aoscp papers)
UP000001570_224308.csv.xz: Bacillus subtilis strain 168 (used in the evdevH2O paper)
inst/extdata/RefDB/RefSeq: Data files processed from RefSeq and used in the geo16S and orp16S papers
Scripts to produce these files are in chem16S
inst/extdata/RefDB/GTDB: Data files processed from GTDB and used in the microhum manuscript
Scripts to produce these files are in chem16S
inst/extdata/RefDB/UHGG: Data files processed from MGnify's UHGG and used in the microhum manuscript
taxonomy.csv.xz
, genome_AA.csv.xz
, and taxon_AA.csv.xz
for the full set of 4744 genomesFirst install the remotes packages from CRAN.
install.packages("remotes")
Then install other required packages: canprot and chem16S.
remotes::install_github("jedick/canprot")
remotes::install_github("jedick/chem16S")
Note
Currently (as of 2023-07-31), JMDplots depends on the development versions of canprot and chem16S from GitHub, not the released versions on CRAN.
Finally, install JMDplots. This command will install prebuilt vignettes; they might not be up-to-date with the source code.
remotes::install_github("jedick/JMDplots")
To view the plots, use the R help browser or this command to open the vignettes page:
browseVignettes("JMDplots")
remotes::install_github("jedick/JMDplots", dependencies = TRUE, build_vignettes = TRUE)
Note It might be possible to build the vignettes without pandoc, but having pandoc available will make them look better.
This package except for the file inst/extdata/orp16S/metadata/PCL+18.csv
is licensed under the GNU General Public License v3 (GPLv3).
The ORP (mV), DO (mg/L) and Feature (Stream, Spring, Lake, Terrace, or Geyser) data for New Zealand hot springs (Power et al., 2018) in PCL+18.csv
were obtained from the 1000 Springs Project and are licensed under CC-BY-NC-SA.
This package contains a copy of the dunnTest()
function by Derek H. Ogle from CRAN package FSA, version 0.9.3 (License: GPL (>= 2)), which itself is a wrapper for dunn.test()
from CRAN package dunn.test by Alexis Dinno.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.