variable_dist_per_site: Calculated distribution of variable per site
In surh/HMVAR: Human Microbiome Variant Analysis in R

Description Usage Arguments Value Examples

Calculates the distribution of values of a categorical variable per site from a table that contains one row per site per sample.

1	variable_dist_per_site(dat, variable, group = NULL)

`dat`	A data frame or tibble containing columns "site_id", "ref_id" and "ref_pos". Each row must correspond to a site per sample.
`variable`	Column name of variable to evaluate. It must be a categorical variable.
`group`	If passed, it must correspond to a column name in dat. That column must be a grouping factor and the distribution will be calculated independently for each group.

A tibble with columns "site_id", "ref_id", and "ref_pos". There will also be one column per level in 'variable', and, optionally, one column for 'group'.

library(magrittr)
map <- readr::read_tsv(system.file("toy_example/map.txt",
                                   package = "HMVAR"),
                       col_types = readr::cols(ID = readr::col_character(),
                                               Group = readr::col_character())) %>%
  dplyr::select(sample = ID,
                tidyselect::everything())
Dat <- read_midas_data(midas_dir = system.file("toy_example/merged.snps/",
                                               package = "HMVAR"),
                       map = map,
                       cds_only = FALSE)

dat <- match_freq_and_depth(freq = Dat$freq,
                            depth = Dat$depth,
                            info = Dat$info,
                            map = map) %>%
  determine_sample_dist()
dat