In kg737/INDEED_Patch: Integrated Differential Expression and Differential Network Analysis (INDEED)

knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>"
)

The biomarker candidates selected by INDEED lead to more accurate survival time prediction compared with those selected by differential expression (DE) analysis and differential network (DN) analysis.

Introduction

Differential expression (DE) analysis is commonly used to identify biomarker candidates that have significant changes in their expression levels between distinct biological groups. One drawback of DE analysis is that it only considers the changes on single biomolecular level. In differential network (DN) analysis, network is typically built based on the correlation and biomarker candidates are selected by investigating the network topology. However, correlation tends to generate over-complicated networks and the selection of biomarker candidates purely based on network topology ignores the changes on single biomolecule level. Thus, we have proposed a novel method INDEED, which considers both the changes on single biomolecular and network levels by integrating DE and DN analysis. INDEED has been published in Methods journal (PMID: 27592383). This is the R package that implements the algorithm and provides a a tool to visualize the network.

This R package will generate a list of dataframes containing information such as p-values, node degree and activity score for each biomolecule. A higher activity score indicates that the corresponding biomolecule has more neighbors connceted in the differential network and their p-values are more statistically significant. It will also generate a csv file for the differential network created by INDEED.

The INDEED package doesn't get loaded automatically, so remember to load it first:

library(INDEED)

glasso function will also be loaded as INDEED depends on it to obtain the sparse differential network.

partial correlation data preprocessing function `pre_partial()`

To use the function sig_select(), users will need to have these data sets in hand:

(data) A data frame that contains the expression level of individual biomolecule from two biologically disparate groups (p * n)
(class_label) A binary array with group 1 labeled as 0 and group 2 as 1
(id) An array that includes the corresponding ID for each biomolecule

partial correlation result calculation function `partial_cor()`

(data_list) A list of dataframes output from pre_partial step
(rho_group1) Rule to choose rho for first group of data, user can input "min" for minimum rule, "ste" as one standard error rule, or user can input a number of choice
(rho_group2) Rule to choose rho for second group of data, user can input "min" for minimum rule, "ste" as one standard error rule, or user can input a number of choice
(permutation) A positive integer number of permutation, default 1000
(p_val) A dataframe containing p-values obtained from DE analysis (optional)

The demo data is presented in the Demo Data section below. It will be automatically loaded with the package.

In partical correlation method, user will need to preprocess the data using pre_partial() function, followed by using partial_cor() and choosing desired rho number and complete the analysis. User can also choose to provide p-value table and number of permutations in partial_cor() function. Result will be saved in a list of two dataframes: activity_score and diff_network. Activity_score contains a dataframe with biomolecules ranked by activity score calculated from p-value and node degree. Diff_network contains a dataframe of connection weight between two nodes.

The following example demonstrates how to use pre_partial() and partial_cor()function:

pre_data <- pre_partial(data=Met_GU,class_label = Met_Group_GU,id=Met_name_GU)
partial_cor(data_list=pre_data,rho_group1='min',
              rho_group2="min",permutation = 1000,p_val=pvalue_M_GU)

In this case, the sparse differential network is based on partial correlation and p-value for each biomolecule is calculated for users. Also, rho is picked based on minimum rule and number of permutations is set to 1000.

non-partial correlation data preprocessing function `non_partial_cor()`

(data) A data frame that contains the expression level of individual biomolecule from two biologically disparate groups (p * n)
(class_label) A binary array with group 1 labeled as 0 and group 2 as 1
(id) An array that includes the corresponding ID for each biomolecule
(method) Type of correlation, user can choose between pearson and spearman correaltion, default pearson
(permutation) A positive integer number of permutation, default 1000
(p_val) An optional dataframe containing p-values obtained from DE analysis

In non partical correlation method, user only need to run non_partial_cor() function. No rho number will be needed.In this method, user input dataframe with expression level, class label, id, method, number of permutation and p-value similar to partical correaltion but in one step fashion. Result will be saved in a list of two dataframes: activity_score and diff_network. Activity_score contains a dataframe with biomolecules ranked by activity score calculated from p-value and node degree. Diff_network contains a dataframe of connection weight between two nodes.

The following example demonstrates how to use non_partial_cor() function:

non_partial_cor(data=Met_GU,class_label = Met_Group_GU,id=Met_name_GU,method="spearman")

Non-Interactive Network Visualization function `network_display()`

(results) This is the result from the calling either partial_cor() or non_partial_cor().
(nodesize) This parameter determines what the size of each node will represent. The options are 'Node_Degree', 'Activity_Score', 'Z_Score', and 'P_Value'. The title of the resulting network will identify which parameter was selected to represent the node size. The default is Node_Degree.
(nodecolor) This parameter determines what color each node will be, based on a yellow or red color gradient. The options are 'Node_Degree', 'Activity_Score', 'Z_Score', and 'P_Value'. A color bar will be created based on which parameter is chosen.
(edgewidth) This is a 'YES' or 'NO' option as to if the edgewidth should be representative of the weight value corresponding to the correlation between two nodes.
(layout) User can choose from a a handful of network visualization templates including:'nice', 'sphere', 'grid', 'star', and 'circle'.

A tool to assist in the visualization of the results from INDEED functions patial_cor() and non_partial_cor(). The size and the color of each node can be adjusted by users to represent either the Node_Degree, Activity_Score, Z_Score, or P_Value. The color of the edge is based on the binary value of either 1 corresonding to a positive correlation dipicted as green or a negative correlation of -1 dipicted as red. The user also has the option of having the width of each edge be proportional to its weight value. The layout of the network can also be customized by choosing from the options: 'nice', 'sphere', 'grid', 'star', and 'circle'.

The following example demonstrates how to use the network_display() function:

network_display(results = result1, layout = 'nice', 
                nodesize = 'Node_Degree', nodecolor = 'Activity_Score', edgewidth = 'NO')

Interactive Network Visualization function `networkVis()`

(results) This is the result from the calling either partial_cor() or non_partial_cor().
(nodesize) This parameter determines what the size of each node will represent. The options are 'Node_Degree', 'Activity_Score', 'Z_Score', and 'P_Value'. The title of the resulting network will identify which parameter was selected to represent the node size. The default is Node_Degree.
(nodecolor) This parameter determines what color each node will be, based on a yellow or red color gradient. The options are 'Node_Degree', 'Activity_Score', 'Z_Score', and 'P_Value'. A color bar will be created based on which parameter is chosen.
(edgewidth) This is a 'YES' or 'NO' option as to if the edgewidth should be representative of the weight value corresponding to the correlation between two nodes.
(layout) User can choose from a a handful of network visualization templates including:'nice', 'sphere', 'grid', 'star', and 'circle'.

An interactive tool to assist in the visualization of the results from INDEED functions patial_corr() or non_partial_corr(). The size and the color of each node can be adjusted by users to represent either the Node_Degree, Activity_Score, Z_Score, or P_Value. The color of the edge is based on the binary value of either 1 corresonding to a positive correlation dipicted as green or a negative correlation of -1 dipicted as red. The user also has the option of having the width of each edge be proportional to its weight value. The layout of the network can also be customized by choosing from the options: 'nice', 'sphere', 'grid', 'star', and circle'. Nodes can be moved and zoomed in on. Each node and edge will display extra information when clicked on. Secondary interactions will be highlighted as well when a node is clicked on.

The following example demonstrates how to use the networkVis() function:

networkVis(results = result1, layout= 'nice', nodesize= 'Node_Degree', 
                                       nodecolor= 'Activity_Score', edgewidth= 'NO')

More Examples

Example 1

pre_data <- pre_partial(data=Met_GU,class_label = Met_Group_GU,id=Met_name_GU)
partial_cor(data_list=pre_data,rho_group1=0.3,rho_group2=0.25,permutation = 500)

By calling the above function, rho for group 1 is 0.3 and rho for group 2 is 0.25, permutation is 500 and p-value will be calculated for users.

Example 2

result2 <- non_partial_cor(data=Met_GU,class_label = Met_Group_GU,
                            id=Met_name_GU,method="spearman",permutation_thres = 0.05)

By calling the above function, the differential network is obtained based on Pearson correlation coefficient. And the table below is part of the csv file output.

Example 3

network_display(results = result1, layout = 'nice', nodesize = 'Node_Degree',
                    nodecolor = 'Activity_Score', edgewidth = 'NO')

networkVis(results = result2, layout= 'nice', nodesize= 'Node_Degree',
                  nodecolor= 'Activity_Score', edgewidth= 'NO')

By calling one of the two functions shown in example 3 either an interactive or non interactive visual network will be displayed based on the outputs from non_parial_cor() or partial_cor() functions.

kg737/INDEED_Patch documentation built on May 22, 2019, 6:32 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

kg737/INDEED_Patch
Integrated Differential Expression and Differential Network Analysis (INDEED)

In kg737/INDEED_Patch: Integrated Differential Expression and Differential Network Analysis (INDEED)

Introduction

partial correlation data preprocessing function `pre_partial()`

partial correlation result calculation function `partial_cor()`

non-partial correlation data preprocessing function `non_partial_cor()`

Non-Interactive Network Visualization function `network_display()`

Interactive Network Visualization function `networkVis()`

More Examples

Example 1

Example 2

Example 3

R Package Documentation

Browse R Packages

We want your feedback!

kg737/INDEED_Patch Integrated Differential Expression and Differential Network Analysis (INDEED)

In kg737/INDEED_Patch: Integrated Differential Expression and Differential Network Analysis (INDEED)

Introduction

partial correlation data preprocessing function pre_partial()

partial correlation result calculation function partial_cor()

non-partial correlation data preprocessing function non_partial_cor()

Non-Interactive Network Visualization function network_display()

Interactive Network Visualization function networkVis()

More Examples

Example 1

Example 2

Example 3

R Package Documentation

Browse R Packages

We want your feedback!

kg737/INDEED_Patch
Integrated Differential Expression and Differential Network Analysis (INDEED)

partial correlation data preprocessing function `pre_partial()`

partial correlation result calculation function `partial_cor()`

non-partial correlation data preprocessing function `non_partial_cor()`

Non-Interactive Network Visualization function `network_display()`

Interactive Network Visualization function `networkVis()`