HGNC.EnforceUnique | R Documentation |
Ensures that gene symbols are unique after being updated with HGNC symbols. This function
applies a suffix to duplicate gene symbols to enforce uniqueness. While using make.unique
might not
be the ideal solution due to potential mismatches, it significantly reduces the number of mismatching
genes in certain scenarios, making it a practical approach for data integration tasks.
HGNC.EnforceUnique(updatedSymbols)
updatedSymbols |
A data frame or matrix containing gene symbols updated via |
The function specifically targets the issue of duplicate gene symbols which can occur after updating gene symbols to their latest HGNC-approved versions. Duplicate symbols can introduce ambiguity in gene expression datasets, affecting downstream analyses like differential expression or data integration. By ensuring each gene symbol is unique, this function helps maintain the integrity of the dataset.
A modified version of the input data frame or matrix with unique gene symbols in the third column.
If duplicates were found, they are made unique by appending .1
, .2
, etc., to the repeated symbols.
This function is a workaround for ensuring unique gene symbols and might not be suitable for all datasets or analyses. It's important to review the results and ensure that the gene symbols accurately represent your data.
## Not run:
if (interactive()) {
# Assuming `SymUpd` is your data frame of updated symbols from HGNChelper::checkGeneSymbols()
uniqueSymbols <- HGNC.EnforceUnique(updatedSymbols = SymUpd)
# `uniqueSymbols` now contains unique gene symbols in its third column
}
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.