popkin_A | R Documentation |
A
and M
matrices from genotypesThis function returns lower-level, intermediate calculations for the main popkin
function.
These are not intended for most users, but rather for researchers studying the estimator.
popkin_A( X, n = NA, loci_on_cols = FALSE, mean_of_ratios = FALSE, mem_factor = 0.7, mem_lim = NA, m_chunk_max = 1000 )
X |
Genotype matrix, |
n |
Number of individuals (required only when |
loci_on_cols |
If |
mean_of_ratios |
Chose how to weigh loci.
If |
mem_factor |
Proportion of available memory to use loading and processing data.
Ignored if |
mem_lim |
Memory limit in GB, used to break up data into chunks for very large datasets.
Note memory usage is somewhat underestimated and is not controlled strictly.
Default in Linux is |
m_chunk_max |
Sets the maximum number of loci to process at the time. Actual number of loci loaded may be lower if memory is limiting. |
A named list containing:
A
: n-by-n matrix, for individuals j
and k
, of average w_i * ( ( x_ij - 1 ) * ( x_ik - 1 ) - 1)
values across all loci i
in X
; if mean_of_ratios = FALSE
, w_i = 1
, otherwise w_i = 1 / (p_est_i * (1 - p_est_i) )
where p_est_i
is the reference allele frequency.
M
: n-by-n matrix of sample sizes (number of loci with non-missing individual j
and k
pairs, used to normalize A
)
The main popkin()
function (a wrapper of this popkin_A
function and popkin_A_min_subpops()
to estimate the minimum A
value).
# Construct toy data X <- matrix(c(0,1,2,1,0,1,1,0,2), nrow = 3, byrow = TRUE) # genotype matrix # NOTE: for BED-formatted input, use BEDMatrix! # "file" is path to BED file (excluding .bed extension) # library(BEDMatrix) # X <- BEDMatrix(file) # load genotype matrix object obj <- popkin_A(X) # calculate A and M from genotypes A <- obj$A M <- obj$M
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.