ExecuteWSNF: Execute the WSNF(Weighted Similarity Network Fusion)
In taoshengxu/CancerSubtypes: Cancer subtypes identification, validation and visualization based on multiple genomic data sets

Description Usage Arguments Value References See Also Examples

WSNF is a caner subtype identificaton method with the assistance of the gene regulatory network information. The basic idea of the WSNF is to set the different regulatory importance(ranking) for each feature. In the WSNF manuscript, WSNF makes use of the miRNA-TF-mRNA regulatory network to take the importance of the features into consideration.

ExecuteWSNF(
  datasets,
  feature_ranking,
  beta = 0.8,
  clusterNum,
  K = 20,
  alpha = 0.5,
  t = 20,
  plot = TRUE
)

`datasets`	A list containing data matrices. For each data matrix, the rows represent genomic features, and the columns represent samples.
`feature_ranking`	A list containing numeric vetors. The length of the feature_ranking list should equal to the length of datasets list. For each numeric vetor represents the ranking of each feature in the corresponding data matrix. The order of the ranking should also mathch the order of the features in the corresponding data matrix. We proive a ranking list for most mRNA, TF(transcription factor) and miRNA features. The ranking for features caculated based on the miRNA-TF-miRNA regulatory network which was promoted in our published work: Identifying Cancer Subtypes from miRNA-TF-mRNA Regulatory Networks and Expression Data(PLos One,2016).
`beta`	A tuning parameter for the feature_ranking contributes the weight of each feature. A linear model is applied to integrate feature_ranking and MAD(median absolute deviation) to generated the final weight for each feature using for the algorithm. The final weight is cauculated as the formula below: Weight(f_i)=beta * feature_ranking + (1-beta) MAD(f_i)
`clusterNum`	A integer representing the return cluster number
`K`	Number of nearest neighbors
`alpha`	Variance for local model
`t`	Number of iterations for the diffusion process
`plot`	Logical value. If true, draw the heatmap for the distance matrix with samples ordered to form clusters.

A list with the following elements.

group : A vector represent the group of cancer subtypes. The order is corresponding to the the samples in the data matrix.

This is the most important result for all clustering methods, so we place it as the first component. The format of group is consistent across different algorithms and therefore makes it convenient for downstream analyses. Moreover, the format of group is also compatible with the K-means result and the hclust (after using the cutree() function).
distanceMatrix : It is a sample similarity matrix. The more large value between samples in the matrix, the more similarity the samples are.

We extracted this matrix from the algorithmic procedure because it is useful for similarity analysis among the samples based on the clustering results.
originalResult : The clustering result of the original SNF algorithm.

Different clustering algorithms have different output formats. Although we have the group component which has consistent format for all of the algorithms (making it easy for downstream analyses), we still keep the output from the original algorithms.

Xu, T., Le, T. D., Liu, L., Wang, R., Sun, B., & Li, J. (2016). Identifying cancer subtypes from mirna-tf-mrna regulatory networks and expression data. PloS one, 11(4), e0152792.

ExecuteSNF

data(GeneExp)
data(miRNAExp)
GBM=list(GeneExp,miRNAExp)
###1. Use the defualt ranking in the package.
data(Ranking)
####Retrieve the feature ranking for genes
gene_Name=rownames(GeneExp)
index1=match(gene_Name,Ranking$mRNA_TF_miRNA.v21_SYMBOL)
gene_ranking=data.frame(gene_Name,Ranking[index1,],stringsAsFactors=FALSE)
index2=which(is.na(gene_ranking$ranking_default))
gene_ranking$ranking_default[index2]=min(gene_ranking$ranking_default,na.rm =TRUE)

####Retrieve the feature ranking for miRNAs
miRNA_ID=rownames(miRNAExp)
index3=match(miRNA_ID,Ranking$mRNA_TF_miRNA_ID)
miRNA_ranking=data.frame(miRNA_ID,Ranking[index3,],stringsAsFactors=FALSE)
index4=which(is.na(miRNA_ranking$ranking_default))
miRNA_ranking$ranking_default[index4]=min(miRNA_ranking$ranking_default,na.rm =TRUE)
###Clustering
ranking1=list(gene_ranking$ranking_default ,miRNA_ranking$ranking_default)
result1=ExecuteWSNF(datasets=GBM, feature_ranking=ranking1, beta = 0.8, clusterNum=3, 
                   K = 20,alpha = 0.5, t = 20, plot = TRUE)

###2. User input ranking
# Fabricate a ranking list for demonstrating the examples.
ranking2=list(runif(nrow(GeneExp), min=0, max=1),runif(nrow(miRNAExp), min=0, max=1))
result2=ExecuteWSNF(datasets=GBM, feature_ranking=ranking2, beta = 0.8, clusterNum=3, 
                   K = 20,alpha = 0.5, t = 20, plot = TRUE)

taoshengxu/CancerSubtypes documentation built on Dec. 23, 2021, 7:46 a.m.

taoshengxu/CancerSubtypes index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

taoshengxu/CancerSubtypes
Cancer subtypes identification, validation and visualization based on multiple genomic data sets

ExecuteWSNF: Execute the WSNF(Weighted Similarity Network Fusion)
In taoshengxu/CancerSubtypes: Cancer subtypes identification, validation and visualization based on multiple genomic data sets

Description

Usage

Arguments

Value

References

See Also

Examples

Related to ExecuteWSNF in taoshengxu/CancerSubtypes...

R Package Documentation

Browse R Packages

We want your feedback!

taoshengxu/CancerSubtypes Cancer subtypes identification, validation and visualization based on multiple genomic data sets

ExecuteWSNF: Execute the WSNF(Weighted Similarity Network Fusion) In taoshengxu/CancerSubtypes: Cancer subtypes identification, validation and visualization based on multiple genomic data sets

Description

Usage

Arguments

Value

References

See Also

Examples

Related to ExecuteWSNF in taoshengxu/CancerSubtypes...

R Package Documentation

Browse R Packages

We want your feedback!

taoshengxu/CancerSubtypes
Cancer subtypes identification, validation and visualization based on multiple genomic data sets

ExecuteWSNF: Execute the WSNF(Weighted Similarity Network Fusion)
In taoshengxu/CancerSubtypes: Cancer subtypes identification, validation and visualization based on multiple genomic data sets