Description Usage Arguments Details
The Chao estimate of the total number of unseen variants based on training gene mutation frequencies
1 |
counts |
vector of counts or frequencies of the observed variants. |
r |
unique frequencies. |
N_r |
frequency of frequency r. |
m |
training cohort size. |
Calculates Chao (1987) estimate of the total number of unseen variants. Also provides an approximate standard error ("se") of the estimate as an attribute, computed using the formula provided in Chao (1987).
## Not run: # load tcga data data("tcga") tcga <- data.table::setDT(tcga) # calculate variant frequencies var_freq <- tcga[, .(v_f = length(unique(patient_id))), by = .(Hugo_Symbol, Variant) ] # calculate cohort size m <- length(unique(tcga$patient_id)) # SGT estimate for t = 0.5, 1, 10 chao_N0(counts = var_freq$v_f, m = m, t = 0.5) ## End(Not run)Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.