View source: R/resolve_duplicate_samples.R
If a sample was sequenced on multiple sequencing runs (i.e. it's present with multiple 'source' variables) this function identifies the sequencing run with more missing data and removes that sample-by-source combination. This function does not test if the duplicates are genotypically identical.If you want to confirm that they are truly duplicates with identical genotype data go have a crack at 'CKMRsim::find_close_matching_genotypes'.
1 | resolve_duplicate_samples(long_genos)
|
long_genos |
genetic data in long format where each sample has 2 rows per locus. This dataframe could be the output from 'filter_raw_microhap_data' |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.