View source: R/non_redundant_pangenomes.R
non_redundant_pangenomes | R Documentation |
Remove redundant sequences from a file list of sequences (nucleotide or protein).
non_redundant_pangenomes(
file_list,
distance,
type = "prot",
n_cores,
sketch = 1000,
kmer = 21
)
file_list |
A data.frame with the path list of files or a gff_list object. |
distance |
Sequence distance (<1) |
type |
"nucl" or "prot" |
n_cores |
number of cores to use (default: total cores -1) |
sketch |
sketch size (mash option) |
kmer |
kmer size (mash option) |
This function has been designed to remove redundant sequences from a dataset. Unlike other non redundant functions, this function only accept a distance threshold and has been designed to remove very similar sequences (>99.9%)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.