semantic_copynumber_filter | R Documentation |
In our parasite data, there are a few gene types which are consistently obnoxious. Multi-gene families primarily where the coding sequences are divergent, but the UTRs nearly identical. For these genes, our sequence based removal methods fail and so this just excludes them by name.
semantic_copynumber_filter(
input,
max_copies = 2,
use_files = FALSE,
invert = TRUE,
semantic = c("mucin", "sialidase", "RHS", "MASP", "DGF", "GP63"),
semantic_column = "product"
)
input |
List of sets of genes deemed significantly up/down with a column expressing approximate count numbers. |
max_copies |
Keep only those genes with <= n putative copies. |
use_files |
Use a set of sequence alignments to define the copy numbers? |
invert |
Keep these genes rather than drop them? |
semantic |
Set of strings with gene names to exclude. |
semantic_column |
Column in the DE table used to find the semantic strings for removal. |
Currently untested, used for Trypanosome analyses primarily, thus the default strings.
Smaller list of up/down genes.
[semantic_copynumber_extract()]
## Not run:
pruned <- semantic_copynumber_filter(table, semantic = c("ribosomal"))
## Get rid of all genes with 'ribosomal' in the annotations.
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.