README.md

PHCPImpute

An R package for imputation of pseudo-haplotypes data, based on the PHCP algorithm. \ The main function takes a pseudo-haplotype target and a set of reference genomes and returns the posterior probabilities for the possible genotypes in each variant. Full details described in Waldman et al., Genome-wide data from medieval German Jews show that the Ashkenazi founder event pre-dated the 14th century, biorxiv, 2022.05.13.491805 (2002).

Installation

The package was built under R version 4.1.0 \ To install the current packages, the devtools package should be installed. \ Additional required packages for usage: dplyr.

# If devtools is not installed
install.packages("devtools") 
devtools::install_github("ShamamW/PHCPImpute")
library(PHCPImpute)
#if dplyr is not installed
install.packages("dplyr")
library(dplyr)

Usage

phcp_impute(haps, ancient_tped, genetic_map, chr, choose_donors = F, variants)

Input

Required arguments

Optional arguments

Output

The output is a data frame with the posterior probabilities of the genotypes in the imputed variants. I.e., each variant is assigned with the probabilities for Ref/Ref, Ref/Alt and Alt/Alt alleles. \ If a data frame with variants to be imputed was supplied to the function ("variants" argument), the output will include only the posterior probabilities of the chosen variants. Otherwise, the output will include the posterior probabilities for all variants in the reference panel.



ShamamW/PHCPImpute documentation built on Oct. 26, 2022, 6:34 a.m.