R/computeHeterozygotesPerGenome.R

computeHeterozygotesPerGenome<-function(msprimeVCFPath, windowSize=500000, totalGenomeLength=3000000){
  df<-as_tibble(fread(msprimeVCFPath))
  df<-df %>%
    mutate(Homozygous=str_detect(msp_0, "1\\|1")) %>%
    mutate(Heterozygous=!Homozygous)
  
  numberOfHeterozygotes<-df %>%
    filter(Heterozygous==TRUE) %>%
    nrow()

  
  return(numberOfHeterozygotes)
  
}
JesseGarcia562/psmc2msprime documentation built on May 29, 2019, 8:20 p.m.