# R/random_gene.R In eriqande/SNPcontam: Detecting contaminated samples from SNP genotypes

#### Defines functions random_gene

#' Random generation of genotypes given genotype probabilities
#'
#' Generates genotypes (using 0, 1, and 2) under the assumption of non-contamination
#' (DNA of only one individual is represented) and contamination (DNA of two randomly
#' sampled individuals is present) given the allele frequencies at each locus.
#' @param n An integer representing the desired number of genotypes to be generated
#' @param af A vector of the allele frequencies for loci giving the frequency of the 1
#' allele in a population.
#' @return Returns a list of two named components:
#' \describe{
#'  \item{rclean}{Matrix with number rows equal to the number of loci (
#'  the length of af) and n columns representing n simulated genotypes.
#'  The given genotypes are generated under the assumption of non-contamination.}
#'  \item{rcontam}{Same as above but given genotypes are generated under the assumption
#'  of contamination.}
#' }
#' @export
#' @examples
#' # generates 5 different genotypes
#' random_gene(5,af = c(SNP1 = .1, SNP2 = .2))
random_gene <- function(n,af) {
L <- likelihood(af) # calls likelihood function which contans genotype probability
l = length(af) # number of loci
# for g matrices: columns are individuals and rows are loci
g1 <- matrix(0, l, n) # matrix for non contamined
g2 <- g1 # matrix for contamined individuals
# loop cycles through each locus
for (i in 1:l){
# stores genotype at locus in all individuals
#L$clean_prob is matrix fo genotype probabilities or each locus g1[i,] = sample(0:2, n, TRUE, prob = L$clean_prob[,i])
# same as above but for contaminated
g2[i,] = sample(0:2, n, TRUE, prob = L\$contam_prob[,i])
}
list(rclean = g1, rcontam = g2) # output the two random groups of genotypes as list
}

eriqande/SNPcontam documentation built on May 13, 2017, 8:20 p.m.