map_identifiers: Documentation

Description Usage Arguments Details Value Examples

Description

Matches one or a group of given identifiers of gene, protein, transcript or gene symbol with unknown identifiers according to given filters.

Usage

1
2
mapIdentifiers(source_ids, source_bioelement, source_datasource, 
target_bioelement, target_datasource)

Arguments

source_ids

a single identifier or a character vector of identifiers. If some of the provided identifiers don't exist, they will be ignored.

source_bioelement

character string: the type of source element. Possible values are "gene", "protein", "transcript" and "genesymbol".

source_datasource

character string: source database name. Possible values are "ncbigene", "ensembl", "uniprot", "refseq", or "-". The value of "-" corresponds to any database currently known. See Examples for the details.

target_bioelement

character string: the type of target elements. Possible values are "gene", "protein", "transcript" and "genesymbol".

target_datasource

character string: target database name. Possible values are "ncbigene", "ensembl", "uniprot", "refseq", or "-". The value of "-" corresponds to all databases currently known. See Examples for the details.

Details

Returns all mappings with a distance of 1 to 3 for a given (set of) input identifiers (source_ids) to identifiers of a given type: the target_bioelement (gene/protein/transcript/genesymbol).

Mapping is done within the range of distances:

Within this range the function finds and returns the min distance number and all corresponding shortest paths descriptions between two elements.

IMPORTANT for NCBI ids: Please make sure you convert the identifier properly from factor/numeric to character before passing it to the mapIdentifiers() function.

Value

An object of class "data.frame" with folowing columns will be returned:

The data.frame contains as many rows as many matching target ids are found.

An empty data.frame is returned if no mappings up to distance 3 to the target bioelement exist.

If some of the provided source ids don't exist, they are ignored. No error or warning is returned for these ids.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
### Mapping gene with the identifier "26" from any known database 
### to all related transcripts from all databases currently known.

mapIdentifiers("26","gene","-","transcript","-")


### Same query, but specifying the source database. 
### Mapping gene with the identifier "26" from NCBI gene to all 
### related transcripts from all databases currently known.

mapIdentifiers("26","gene","ncbigene","transcript","-")


### Same query, but specifying the target database. 
### Mapping gene with the identifier "26" from NCBI gene to all 
### related transcripts from RefSeq.

mapIdentifiers("26","gene","ncbigene","transcript","refseq")


### Map the gene symbol "TNF" to Ensembl genes

mapIdentifiers("TNF","genesymbol","-","gene","ensembl")


### Map the NCBI gene id "7124" to Uniprot proteins

mapIdentifiers("7124","gene","ncbigene","protein","uniprot")


### Map a list of NCBI gene ids to proteins
### from all databases currently known.

mapIdentifiers(c("26","81539"),"gene","ncbigene","protein","-")

icb-knowing/biomapr documentation built on May 25, 2019, 6:22 p.m.