View source: R/estimate_lineages.R
estimate_lineages | R Documentation |
Create summary tables containing data on lineages identified in samples, including estimates of relative proportions of lineages and identities of associated characteristic mutations.
estimate_lineages( muts.df, min.alt.freq = 0.01, dates = NULL, lineage.muts = NULL, read.muts.from = NULL, scale = TRUE, use.median = FALSE, outfile.name = NULL, presence.thresh = 0.5, samps.to.inc = NULL, locs.to.inc = NULL, lineages.to.inc = NULL, report.all = FALSE, depths.from = "all" )
muts.df |
A data frame (produced by |
min.alt.freq |
Minimum frequency (0-1) for mutation to be counted. Default = 0.01. |
dates |
Path to optional csv file with cols "SAMP_NAME", "LOCATION", and "DATE". Sample names need to match those in samp_mutations data frame created by |
lineage.muts |
(Required) Path to csv file with cols "Gene", "Mutation", and "Lineage" defining mutations associated with lineages of interest. See example file at "https://github.com/mikesovic/MixviR/blob/main/mutation_files/outbreak_20220217.csv". Additional columns will be ignored. |
read.muts.from |
An alternative to muts.df for providing input. If a data frame generated by |
scale |
Logical to indicate whether estimated proportions of lineages within a sample should be scaled down to sum to 1 if the sum of the initial estimates is > 1. Default = TRUE. |
use.median |
Logical to define the metric used to estimate frequencies of lineages in samples. Default = FALSE (mean is used). |
outfile.name |
If writing output to file, a character string giving the name/path of the file (csv) to be written. |
presence.thresh |
Numeric (0-1) defining a proportion of characteristic mutations that must be present in the sample for a lineage to be considered present. This threshold is applied if report.all = FALSE (the default). |
samps.to.inc |
Character vector of one or more sample names to include. If NULL (default), all samples are included. |
locs.to.inc |
Character vector of one or more locations to include. If NULL (default), all locations are included. Applies only if a dates file is provided, and these locations must match those in the 'LOCATION' column of that file. |
lineages.to.inc |
Character vector of one or more lineages to test for and report in results. If NULL (default), all lineages listed in the lineage.muts file are evaluated and reported. |
report.all |
Logical indicating whether to report results for all lineages (TRUE), or just those with a proportion of mutations present that exceeds presence.thresh. Default FALSE. |
depths.from |
Character, one of "all" (default) or "characteristic". If "all", average sequencing depths are calculated based on all mutations in a sample. If "characteristic", mean depths are calculated from the set of mutations that occur in only one analyzed lineage (mutations shared by two or more lineages are filtered out prior to calculating depths). |
Data frame containing estimates of proportions of each lineage in the sample.
estimate_lineages(lineage.muts = system.file("extdata", "example_lineage_muts.csv", package = "MixviR"), read.muts.from = system.file("extdata", "sample_mutations.csv", package = "MixviR"))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.