querycluster: Cluster One ncRNA Gene Based On Its Co-Expression in Multiple...

Description Usage Arguments Details Value Author(s) References See Also Examples

View source: R/LINC_FUNCTIONS.R

Description

querycluster takes a set of 'LINCcluster' objects, extracts the respective co-expressed protein-coding genes and plots a dendrogram with the distance matrix attached. This function is intended to be applied in a case where a particular ncRNA occurrs in datasets which represent different tissues, batches, statistical corrections, reduced gene sets, controls and so on. The output will show the clustering of the groups and therefore the information under which condition is the co-expression to the query most similar.

Usage

1
2
3
4
5
6
7
querycluster(query = NULL,
             queryTitle = NULL,
             traits = NULL, 
             method = "spearman",
             returnDat = FALSE,
             mo_promise,
             ...)

Arguments

query

the query name, i.e. the gene id of a ncRNA present in the supplied input

queryTitle

a character string used as the title of the plot

traits

NULL or a single integer. For NULL all co-expressed genes will be used. A number instead will be considered as maximal number of traits.

method

a character string, has to be one of c("spearman", "dicedist")

returnDat

whether to return the data used to create the plot TRUE or not FALSE

mo_promise

mo_promise = 'list', a list of 'LINCcluster' objects (see example)

...

the 'LINCcluster' objects itself, but not a combination of both, mo_promise = 'list' and supplying the objects itself (see example)

Details

This function will search for co-expressed protein-coding genes which belong to a the defined query. Based on the co-expression in the input 'LINCcluster' objects a distance matrix is computed. The method "spearman" finds the union of all interaction partners for the query und calculates the correlation between the 'LINCcluster' objects. For this method the distance measure is (1 - correlation). Alternatively, method = "dicedist" takes the Czekanovski dice distance [1] as distance mesaure of the traits = n genes. This method, however, will not work with traits = NULL. Choosing a low number for n will limit the number of different values in the distance matrix.

Apart from queryTitle the command 'LINCcluster' + feature(customID = ..., customCol = ...) enables a customized plot as output. For this to work the supplied 'LINCcluster' objects in ... have to be modified by the function feature(...) in advance.

Value

an object of the class 'gg' or a 'list' if returnDat = TRUE

Author(s)

Manuel Goepferich

References

[1] Christine Brun, Francois Chevenet, David Martin, Jerome Wojcik, Alain Guenoche and Bernard Jacq" Functional classification of proteins for the prediction of cellular function from a protein-protein interaction network" (2003) Genome Biology, 5:R6.

See Also

feature ; clusterlinc

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
data(BRAIN_EXPR)

# add custom names and colors
gbm_cluster <- gbm_cluster + feature(customID = "CANCER_GBM", customCol = "red")
ctx_cluster <- ctx_cluster + feature(customID = "HEALTHY_CTX", customCol = "blue")
hpc_cluster <- hpc_cluster + feature(customID = "HEALTHY_HPC", customCol = "blue")
crbl_cluster <- crbl_cluster + feature(customID = "HEALTHY_CRBL", customCol = "blue")

# plot the dendrogram
querycluster('647979', queryTitle = 'NORAD',
             gbm_cluster,  # Glioblastoma
             ctx_cluster,  # Cortex
             hpc_cluster,  # Hippocampus
             crbl_cluster) # Cerebellum

# objects can also be supplied as a list
query_list <- list(gbm_cluster,
                   ctx_cluster,
                   hpc_cluster,
                   crbl_cluster)

# mo_promise is the (informal) argument for multiple objects
querycluster(query = '647979', queryTitle = 'NORAD', mo_promise = query_list)

# used the Czekanovski dice distance based on the 25 best 
# interaction partners in each tissue
querycluster(query = '647979', method = "dicedist", traits = 25, mo_promise = query_list)

# NOT RUN:
# querycluster(query = '647979', method = "dicedist", mo_promise = query_list)

ManuelGoepferich/LINC documentation built on May 7, 2019, 2:46 p.m.