Description Usage Arguments Details Value Note Author(s) Examples
It is also possible to either enrich the screening dataset file with additional information provided by the biomaRt interface. For example, gene identifiers can be changed from EnsemblIDs to official gene symbols are Gene Ontology terms can be added to the dataset. This can be done using 'get.gene.info', which serves as a wrapper for the **biomaRt** package with its load of options and possibilities (more information see '?biomaRt').
You can convert any gene identifier which is included in your sgRNA identifer to e.g. EnsemblID or HGNC Gene Symbol using caRpools. **Please note that Internet Access is required for biomaRt.** For further information about biomaRt conversion, please see the [biomaRt Manual](www.bioconductor.org/packages/release/bioc/vignettes/biomaRt/inst/doc/biomaRt.pdf).
1 2 3 4 | get.gene.info(data, namecolumn=1, extractpattern=expression("^(.+?)(_.+)"),
host="www.ensembl.org", database="ENSEMBL_MART_ENSEMBL", dataset="hsapiens_gene_ensembl",
filters="ensembl_gene_id", attributes = c("hgnc_symbol"),
return.val = "dataset", controls=FALSE)
|
data |
Data frame that contains read-count data. *Default* none *Values* data.frame containing read-count data (data.frame) |
namecolumn |
In which column are the sgRNA identifiers? *Default* 1 *Values* column number (numeric) |
extractpattern |
PERL regular expression that is used to retrieve the gene identifier from the overall sgRNA identifier. e.g. in **AAK1_107_0** it will extract **AAK1**, since this is the gene identifier beloning to this sgRNA identifier. **Please see: Read-Count Data Files** *Default* expression("^(.+?)(_.+)"), will work for most available libraries. *Values* PERL regular expression with parenthesis indicating the gene identifier (expression) |
host |
Host used to retrieve biomaRt information. By default, host is set to www.ensembl.org. |
database |
BiomaRt database to be used. See '?listMarts()' or biomaRt documentation. *Default* "ENSEMBL_MART_ENSEMBL", is using the ensembl database *Values* Any biomaRt database (character) |
dataset |
The biomaRt dataset to be used. For *homo sapiens*, *hsapiens_gene_ensembl* is recommended. See '?listDatasets' or biomaRt documentation. *Default* "hsapiens_gene_ensembl" *Values* Any biomaRt dataset (character) |
filters |
The input filter information to retrieve biomaRt annotation, usually is the type of gene identifier used in the read-count files, e.g. "ensemble_gene_id". see '?listFilters' *Default* "ensembl_gene_id" *Values* Any biomaRt filter (character) |
attributes |
The output attribute to retrieve from biomaRt, usually the annotations that need to be fetched, e.g. "hgnc_symbol". see '?listAttributes' *Default* "hgnc_symbol" *Values* Any biomaRt attribute (character) |
return.val |
The type of object that is returned. For whole dataset, e.g. conversion of gene identifiers, use "dataset". *Default* "dataset" *Values* "dataset" (will give back the same data frame, but with exchanged gene identifiers), "info" (will return a data frame with all attributes fetched for genes, is used to annotate gene with additional information) |
controls |
Is set to TRUE if 'data' is not a data frame, but a vector. *Default* FALSE *Values* TRUE, FALSE (boolean) |
none
Return either a data.frame with converted gene identifier or a data frame with annotations.
none
Jan Winter
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 | data(caRpools)
#CONTROL1.replaced = get.gene.info(CONTROL1, namecolumn=1,
#extractpattern=expression("^(.+?)(_.+)"), host="www.ensembl.org",
#database="ensembl", #dataset="hsapiens_gene_ensembl",
#filters="hgnc_symbol",attributes = c("ensembl_gene_id"),
#return.val = "dataset")
#knitr::kable(CONTROL1.replaced[1:10,])
#CONTROL1.replaced.info = get.gene.info(CONTROL1, namecolumn=1,
#extractpattern=expression("^(.+?)(_.+)"), database="ensembl",
#dataset="hsapiens_gene_ensembl", filters="hgnc_symbol",
#attributes = c("ensembl_gene_id","description"), return.val = "info")
#knitr::kable(CONTROL1.replaced.info[1:10,])
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.