geneid.map: Function to find the common genes between two datasets or a...

Description Usage Arguments Value Note Author(s) Examples

Description

This function allows for fast mapping between two datasets or a dataset and a gene list. The mapping process is performed using Entrez Gene id as reference. In case of ambiguities (several probes representing the same gene), the most variant probe is selected.

Usage

1
geneid.map(geneid1, data1, geneid2, data2, verbose = FALSE)

Arguments

geneid1

first vector of Entrez Gene ids. The name of the vector cells must be the name of the probes in the dataset data1.

data1

First dataset with samples in rows and probes in columns. The dimnames must be properly defined.

geneid2

Second vector of Entrez Gene ids. The name of the vector cells must be the name of the probes in the dataset data1 if it is not missing, proper names must be assigned otherwise.

data2

First dataset with samples in rows and probes in columns. The dimnames must be properly defined. It may be missing.

verbose

TRUE to print informative messages, FALSE otherwise.

Value

geneid1

Mapped gene list from geneid1.

data1

Mapped dataset from data1.

geneid2

Mapped gene list from geneid2.

data2

Mapped dataset from data2.

Note

It is mandatory that the names of geneid1 and geneid2 must be the probe names of the microarray platform.

Author(s)

Benjamin Haibe-Kains

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
## load NKI data
data(nkis)
nkis.gid <- annot.nkis[ ,"EntrezGene.ID"]
names(nkis.gid) <- dimnames(annot.nkis)[[1]]
## load GGI signature
data(sig.ggi)
ggi.gid <- sig.ggi[ ,"EntrezGene.ID"]
names(ggi.gid) <- as.character(sig.ggi[ ,"probe"])
## mapping through Entrez Gene ids of NKI and GGI signature
res <- geneid.map(geneid1=nkis.gid, data1=data.nkis,
  geneid2=ggi.gid, verbose=FALSE)
str(res)

Example output

Loading required package: survcomp
Loading required package: survival
Loading required package: prodlim
Loading required package: mclust
Package 'mclust' version 5.4.7
Type 'citation("mclust")' for citing this R package in publications.
Loading required package: limma
Loading required package: biomaRt
Loading required package: iC10
Loading required package: pamr
Loading required package: cluster
Loading required package: impute
Loading required package: iC10TrainingData
Loading required package: AIMS
Loading required package: e1071
Loading required package: Biobase
Loading required package: BiocGenerics
Loading required package: parallel

Attaching package:BiocGenericsThe following objects are masked frompackage:parallel:

    clusterApply, clusterApplyLB, clusterCall, clusterEvalQ,
    clusterExport, clusterMap, parApply, parCapply, parLapply,
    parLapplyLB, parRapply, parSapply, parSapplyLB

The following object is masked frompackage:limma:

    plotMA

The following objects are masked frompackage:stats:

    IQR, mad, sd, var, xtabs

The following objects are masked frompackage:base:

    anyDuplicated, append, as.data.frame, basename, cbind, colnames,
    dirname, do.call, duplicated, eval, evalq, Filter, Find, get, grep,
    grepl, intersect, is.unsorted, lapply, Map, mapply, match, mget,
    order, paste, pmax, pmax.int, pmin, pmin.int, Position, rank,
    rbind, Reduce, rownames, sapply, setdiff, sort, table, tapply,
    union, unique, unsplit, which.max, which.min

Welcome to Bioconductor

    Vignettes contain introductory material; view with
    'browseVignettes()'. To cite Bioconductor, see
    'citation("Biobase")', and for packages 'citation("pkgname")'.

List of 4
 $ geneid1: Named chr [1:54] "10212" "4605" "332" "4171" ...
  ..- attr(*, "names")= chr [1:54] "NM_005804" "NM_002466" "NM_001168" "NM_004526" ...
 $ data1  : num [1:150, 1:54] -0.078 0.321 -0.068 -0.282 -0.178 -0.157 -0.017 0.263 -0.07 -0.156 ...
  ..- attr(*, "dimnames")=List of 2
  .. ..$ : chr [1:150] "NKI_123" "NKI_327" "NKI_291" "NKI_370" ...
  .. ..$ : chr [1:54] "NM_005804" "NM_002466" "NM_001168" "NM_004526" ...
 $ geneid2: Named chr [1:54] "10212" "4605" "332" "4171" ...
  ..- attr(*, "names")= chr [1:54] "201584_s_at" "201710_at" "202094_at" "202107_s_at" ...
 $ data2  : NULL

genefu documentation built on Jan. 28, 2021, 2:01 a.m.