calc_mass_fractions: calc_mass_fractions

Description Usage Arguments Value

View source: R/calculate_mass_fractions.R

Description

A mass balance approach to ruling out sources of contamination. This function uses published source PAH concentrations, and calculates the mass fraction (as a percent) of the source that would be required to account for the reported concentration in the samples. For sources with multiple observations, the mean value is used. Mass fraction can be calculated for each sample, or for summary statistics across all samples (min, quartiles, mean, max).

Usage

1
2
3
calc_mass_fractions(compound_info, sample_column, conc_column,
  compound_column, conc_unit = "ppb", calc_type = "summary",
  plot = FALSE, sample_order = "norm_pah_conc")

Arguments

compound_info

The output dataframe from 'get_compound_info', which contains sample concentrations as well as compound-specific information, including whether the compound is one of 16 EPA priority compounds and compound-specific toxicity.

sample_column

string, column that contains unique sample identifier

conc_column

string, column that contains sample concentrations

compound_column

string, column that contains compound names.

conc_unit

string, the units of PAH concentrations, either "ppb" (ug/kg) or "ppm" (mg/kg).

calc_type

how to calculate mass fractions, either for each individual sample ('by_sample'), or by summary statistics across all samples 'summary'. Summary calculates mass fractions for all quartiles, minimum, mean, and maximum of sample concentrations.

plot

logical, whether 'by_sample' should be summarized as a tile plot rather than table.

sample_order

string, how the samples should be ordered, either by 'pah_conc', which is the sum of the EPA 16 priority compounds, or 'norm_pah_conc' which is the TOC-normalized PAH 16 concentration. Sources are considered "unlikely" when the percent of source to sample is greater than the percent TOC in the sample, given that PAHs are limited to the organic fraction. Ordering by TOC-normalized PAH concentration gives a smoother look to the figure, but is less intuitive in terms of sample ordering.

Value

If calc_type is "summary", each row represents a source, and source mean concentrations, number of PAHs used, and references are reported alongside percent mass fractions calculated for all quartiles, minimum, and maximum of all sample concentrations. If calc_type is 'by_sample', a data frame of n samples x j sources is given, where each cell represents the mass fraction for that sample-source combination. Sample IDs are given in a column, and columns are named by source ID.


limnoliver/pah documentation built on April 30, 2020, 2:45 p.m.