\let\OldRule\rule \renewcommand{\rule}[2]{\OldRule{\linewidth}{#2}} \renewcommand{\contentsname}{R topics documented:}

Package: metaITH
Title: A Tool for Analyzing Intratumor Heterogeneity at Multiple Levels
Version: 0.0.0.9000
Author: Anchal Sharma, aanchalsharma833@gmail.com, Elise Merritt, em575@scarletmail.rutgers.edu, Subhajyoti De, sd948@cinj.rutgers.edu
Description: This package allows for analysis of intratumor heterogeneity at the DNA, RNA, and immunogenic levels.
Depends: R (>= 3.5.0), ggplot2, reshape2, ape, phylobase, dplyr, matrixStats
License: MIT
Encoding: UTF-8
LazyData: true
RoxygenNote: 6.1.1
Suggests: knitr, rmarkdown
VignetteBuilder: knitr

\tableofcontents \newpage

Installation

Must have R version 3.5 or higher before installation

install.packages(devtools)
library(devtools)
install_packages("sjdlabgroup/metaITH")
library(metaITH)

To run metaITH using example datasets:
1. Go to https://github.com/sjdlabgroup/metaITH
2. Click on the green "Clone or download" button at the top right and select "Download ZIP"
3. Unzip the downloaded folder. Example datasets are in the directory input_files
4. Before running any of the code snippets under header "Example" in this vignette, set the working directory to the folder the example file is in.*

*The folder each example file is in is specified under header "Arguments". For example, to run the first function "signature_analysis" using provided example dataset "RNA_matrix.txt" you would first set the working directory to metaITH/input_files/sig_analysis_inputs.

To run metaITH using user-provided datasets:
1. Set the working directory to the folder that contains the datasets before using any functions.

Functions

Signatures


signature_analysis


Description
Proliferation, hypoxia, EMT, apoptosis, anti-PD1 favor and drug resistance scores are calculated for each tumor sample using gene expression data [log2 (TPM+1)] for genes that comprise of respective signatures. This one function can calculate all scores in one go, or a specific analysis for only one signature.

Usage
signature_analysis(rna_expression_matrix_file, type)

Arguments

Details
Takes in RNA expression matrix containing the expression level of all genes in all tumor samples and normals. The expression data may be from bulk sequencing or single cell sequencing data. Creates a z-score matrix and uses this matrix to find the hypoxia, proliferation, apoptosis, drug-resistance, emt, and anti-PD1 favor scores for each tumor sample or region, depending on input string given. Outputs a table and heatmap for each of these scores.

Value
This function returns up to thirteen files:

  1. A table of hypoxia scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "Hypoxia score" shows the hypoxia score for each tumor region or sample. Example file "Hypoxia_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.

  2. A .tiff image file visualizing the hypoxia scores as a heatmap. Example file "Hypoxia_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

  3. A table of proliferation scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "Proliferation score" shows the proliferation score for each tumor region or sample. Example file "Proliferation_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.

  4. A .tiff image file visualizing the proliferation scores as a heatmap. Example file "Proliferation_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

  5. A table of apoptosis scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "Apoptosis score" shows the apoptosis score for each tumor region or sample. Example file "Apoptosis_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.

  6. A .tiff image file visualizing the apoptosis scores as a heatmap. Example file "Apoptosis_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

  7. A table of drug-resistance scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "Pemetrexed resistance score" shows the drug-resistance score for each tumor region or sample. Example file "Pemetrexed_resistance_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.

  8. A .tiff image file visualizing the drug-resistance scores as a heatmap. Example file "Pemetrexed_resistance_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

  9. A table of emt scores. First column is "Gene", followed by one column for each tumor region or sample. Three rows, one for sample names, one called "Epithelial score" shows the epithelial score for each tumor region or sample, one called "Mesenchymal score" shows the mesenchymal score for each tumor region or sample. Example file "Epithelial_Mesenchymal_scores.txt" can be seen under metaITH/output_files/sig_analysis_outputs.

  10. A table of mesenchymal-epithelial scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "M-E score" shows the mesenchymal-epithelial score for each tumor region or sample. Example file "Mesenchymal-Epithelial_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.

  11. A .tiff image file visualizing the mesenchymal-epithelial scores as a heatmap. Example file "Mesenchymal-Epithelial_score_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

  12. A table of anti-PD1 favor scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "anti-PD1 favor score" shows the anti-PD1 favor score for each tumor region or sample. Example file "anti-PD1_favor_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.

  13. A .tiff image file visualizing the anti-PD1 scores as a heatmap. Example file "anti-PD1_favor_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

Example

require("knitr")
opts_knit$set(root.dir = "~/proliferative_index_files/newid_PC_analysis/metaITH/R")
library(metaITH)
signature_analysis("RNA_matrix.txt", "all")

read.table("Proliferation_score.txt", sep="\t", header=T)[,1:4]

z_score_calculations


Description
Calculates z-scores for all genes in all tumor samples. This function must be run before running the function "specified_geneset.

Usage
z_score_calculations(rna_expression_matrix_file)

Arguments

Details
Takes in RNA expression matrix containing the expression level (log2(TPM+1) or other measure) of all genes in all tumor and normal samples. The data may be from bulk sequencing or single cell sequencing data. Output is a matrix containing z-scores for all genes in the entire tumor samples and writes a tab-delimited text file. Z-score for each gene in each tumor sample is calculated using mean and standard deviation calculated form respective gene expression across all normal samples.

Value
This function returns one file:
1. A table of z-score values. First column contains all genes found in RNA expression matrix. Subsequent columns contain z-score of each gene for each sample. Example file "z-scores_matrix.txt" can be seen under metaITH/output_files/sig_analysis_outputs.

Example

z_score_calculations("RNA_matrix.txt")  
read.table("z-scores_matrix.txt", sep="\t", header=T)[1:4,1:4]  

specified_geneset_signature


Description
z-score is calculated for each tumor sample using gene expression data [log2 (TPM+1)] for user specified number of genes (in the form of a gene list).

Usage
specified_geneset_signature(z_score_matrix_file, geneset)

Arguments
Both files must be placed in the same folder before running function.

Details
Takes in a z-score matrix created by function z_score_calculations. Outputs a table of scores for each tumor sample and a heatmap of the scores. For each gene, z-scores corresponding to its expression in tumor is calculated based on mean and standard deviation of its expression in the normal samples, and then score for each tumor sample is defined as mean z-scores of the genes involved.

Value
This function returns two files:
1. A table of geneset scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "Geneset" score" shows the geneset score for each tumor region or sample. Example file "Geneset_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.
2. A .tiff image file of a heatmap. Example file "Geneset_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

Example

specified_geneset_signature("z-scores_matrix.txt", "proliferation_geneset.txt")
read.table("Geneset_score.txt", sep="\t", header=T)[,1:4]

Dendrograms


dna_dendrograms


Description
Calculates distance matrix from variant allele frequency data of different regions of a tumor and normal samples and generates a unrooted dendrogram from it using neighbor joining method. Apart from multi-region sequencing data, it could also be for any set of tumor or normal samples.

Usage
dna_dendrograms(dna_dendro_list_file)

Arguments

Details
Takes in a text file containing list of file names. Each file contains a matrix with variant allele frequency of all the variations in each tumor region and normal. Creates a DNA distance matrix, containing the distance between all the tumor regions and normal, a DNA tree topology file, and an unrooted DNA dendrogram. Distance is defined as the average difference of variant allele frequencies across all variations between any two regions (or samples).

Value
This function returns three files:
1. DNA distance matrix is a matrix of all samples versus all samples. Each entry in the table corresponds to the distance between the row sample and column sample. Example file "DNA_distance_matrix_DNA_S1_frequency_matrix.txt" can be seen under metaITH/output_files/dendro_outputs.
2. A text file containing information about the tree topology. Example file "DNA_tree_DNA_S1_frequency_matrix.txt" can be seen under metaITH/output_files/dendro_outputs.
3. A .tiff image file of an unrooted dendrogram. Example file "DNA_unrooted_dendrogram_DNA_S1_frequency_matrix.txt.tiff" can be seen under metaITH/output_files/dendro_outputs.

Example

dna_dendrograms("DNA_dendro_list.txt")
read.table("DNA_distance_matrix_DNA_S1_frequency_matrix.txt", sep="\t", header=T)

immune_dendrograms


Description
Calculates distance matrix from proportion of different immune cell types data (output obtained from CIBERSORT) of different regions of a tumor and normal samples and generates a unrooted dendrogram from it using neighbor joining method. Apart from multi-region sequencing data, it could also be for any set of tumor or normal samples.

Usage
immune_dendrograms(immune_dendro_list_file)

Arguments

Details
Takes in a text file containing list of file names. Each file contains a matrix with different immune cell type proportions in each tumor region and normal. Creates a immune distance matrix, containing the distance between all the tumor regions and normal, a immune tree topology file, and an unrooted immune dendrogram. Distance is defined as the average difference of immune cell type proportions across all cell types between any two regions (or samples).

Value
This function returns three files:
1. Immune distance matrix is a matrix of all samples versus all samples. Each entry in the table corresponds to the distance between the row sample and column sample. Example file "Immune_distance_matrix_S1_Immune_CIBERSORT_matrix.txt" can be seen under metaITH/output_files/dendro_outputs.
2. A text file containing information about the tree topology. Example file "Immune_tree_S1_Immune_CIBERSORT_matrix.txt" can be seen under metaITH/output_files/dendro_outputs.
3. A .tiff image file of an unrooted dendrogram. Example file "Immune_unrooted_dendrogram_S1_Immune_CIBERSORT_matrix.txt.tiff" can be seen under metaITH/output_files/dendro_outputs.

Example

immune_dendrograms("Immune_dendro_list.txt")
read.table("Immune_distance_matrix_S1_Immune_CIBERSORT_matrix.txt", sep="\t", header=T)

rna_dendrograms


Description
Calculates distance matrix from gene expression data (log2(TPM+1)) of different regions of a tumor and normal samples and generates a unrooted dendrogram from it using neighbor joining method. Apart from multi-region sequencing data, it could also be for any set of tumor or normal samples.

Usage
rna_dendrograms(rna_dendro_list_file)

Arguments

Details

Takes in a text file containing list of file names. Each file contains a matrix with gene expression data of all genes in each tumor region and normal. Creates a RNA distance matrix, containing the distance between all the tumor regions and normal, a RNA tree topology file, and an unrooted RNA dendrogram. Distance is defined as the average difference of gene expression values across all genes between any two regions (or samples).

Value
This function returns three files:
1. RNA distance matrix is a matrix of all samples versus all samples. Each entry in the table corresponds to the distance between the row sample and column sample. Example file "RNA_distance_matrix_S1_RNA_expression_matrix.txt" can be seen under metaITH/output_files/dendro_outputs.
2. A text file containing information about the tree topology. Example file "RNA_tree_S1_RNA_expression_matrix.txt" can be seen under metaITH/output_files/dendro_outputs.
3. A .tiff image file of an unrooted dendrogram. Example file "RNA_unrooted_dendrogram_S1_RNA_expression_matrix.txt.tiff" can be seen under metaITH/output_files/dendro_outputs.

Example

rna_dendrograms("RNA_dendro_list.txt")
read.table("RNA_distance_matrix_S1_RNA_expression_matrix.txt", sep="\t", header=T)

snv_heatmaps


Description
Generates a heatmap of variant allele frequencies of somatic variations in different regions of one tumor.

Usage
snv_heatmaps(dna_dendro_list_file)

Arguments

Details
Takes in a text file containing list of file names. Each file contains a matrix with variant allele frequency of all the variations in each tumor region and normal. Creates a heatmap of the variant allele frequencies.

Value
This function returns one file:
1. A .tiff image file of a heatmap. Example file "SNV_heatmap_DNA_S1_frequency_matrix.txt.tiff" can be seen under metaITH/output_files/dendro_outputs.

Example

snv_heatmaps("DNA_dendro_list.txt")

Multi-region Divergence and Diversity


multi_level_divergence_diversity


Description
Provides an estimate of the ratio of intra-patient regional diversity to tumor-benign tissue divergence.

Usage
multi_level_divergence_diversity(sample_names_file)

Arguments
Requires output of functions dna_dendrograms, rna_dendrograms, and immune_dendrograms to be in same folder as sample_names_file * sample_names_file: A text file containing list of sample names, one file name per line. Example of list of sample files "sample_names.txt" can be seen under metaITH/input_files/multiregion_inputs. Example of individual matrix files "DNA_S1_frequency_matrix.txt", "S1_Immune_CIBERSORT_matrix.txt", and "S1_RNA_expression_matrix.txt" can be seen under metaITH/input_files/multiregion_inputs.

Details
Takes in text file containing list of samples, one sample per line, and distance matrix files generated by dna_dendrograms, rna_dendrograms, and immune_dendrograms methods. Outputs a png file containing 6 plots, 3 diversity/divergence plots and 3 diversity plots.

Value
This function returns one file:
1. A .png image file. Example file "png_multiomics_ITH_comparison.png" can be seen under metaITH/output_files/multiregion_outputs.

Example

multi_level_divergence_diversity("sample_names.txt")

Utilities


metaITH_analysis


Description
Runs a comprehensive intra-tumor heterogeneity analyses including dendrogram analysis (using DNA, RNA, and immune expression data), signature analysis (using RNA expression data), and multi-region diversity and divergence analysis (using DNA, RNA, and immune expression data) in one go.

Usage
metaITH_analysis(dna_dendro_list, rna_dendro_list, immune_dendro_list, multi_region_sample_names, rna_expression_matrix)

Arguments
All input files must be placed in same folder.

Details
Analyzes several sample files by running a phylogram analysis (using DNA, RNA, and immune expression data), signature analysis (using RNA expression data), and multi-region diversity and divergence analysis (using DNA, RNA, and immune expression data).

Value
This function returns twenty-three files:

  1. DNA distance matrix is a matrix of all samples versus all samples. Each entry in the table corresponds to the distance between the row sample and column sample. Example file "DNA_distance_matrix_DNA_S1_frequency_matrix.txt" can be seen under metaITH/output_files/dendro_outputs.

  2. A text file containing information about the DNA tree topology. Example file "DNA_tree_DNA_S1_frequency_matrix.txt" can be seen under metaITH/output_files/dendro_outputs.

  3. A .tiff image file of an unrooted DNA dendrogram. Example file "DNA_unrooted_dendrogram_DNA_S1_frequency_matrix.txt.tiff" can be seen under metaITH/output_files/dendro_outputs.

  4. RNA distance matrix is a matrix of all samples versus all samples. Each entry in the table corresponds to the distance between the row sample and column sample. Example file "RNA_distance_matrix_S1_RNA_expression_matrix.txt" can be seen under metaITH/output_files/dendro_outputs.

  5. A text file containing information about the RNA tree topology. Example file "RNA_tree_S1_RNA_expression_matrix.txt" can be seen under metaITH/output_files/dendro_outputs.

  6. A .tiff image file of an unrooted RNA dendrogram. Example file "RNA_unrooted_dendrogram_S1_RNA_expression_matrix.txt.tiff" can be seen under metaITH/output_files/dendro_outputs.

  7. Immune distance matrix is a matrix of all samples versus all samples. Each entry in the table corresponds to the distance between the row sample and column sample. Example file "Immune_distance_matrix_S1_Immune_CIBERSORT_matrix.txt" can be seen under metaITH/output_files/dendro_outputs.

  8. A text file containing information about the immune tree topology. Example file "Immune_tree_S1_Immune_CIBERSORT_matrix.txt" can be seen under metaITH/output_files/dendro_outputs.

  9. A .tiff image file showing an unrooted immune dendrogram. Example file "Immune_unrooted_dendrogram_S1_Immune_CIBERSORT_matrix.txt.tiff" can be seen under metaITH/output_files/dendro_outputs.

  10. A .png image file containing 6 plots, 3 diversity/divergence plots and 3 diversity plots. Example file "png_multiomics_ITH_comparison.png" can be seen under metaITH/output_files/multiregion_outputs.

  11. A tab delimited .txt file with hypoxia scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "Hypoxia score" shows the hypoxia score for each tumor region or sample. Example file "Hypoxia_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.

  12. A .tiff image file visualizing the hypoxia scores as a heatmap. Example file "Hypoxia_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

  13. A tab delimited .txt file with proliferation scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "Proliferation score" shows the proliferation score for each tumor region or sample. Example file "Proliferation_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.

  14. A .tiff image file visualizing the proliferation scores as a heatmap. Example file "Proliferation_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

  15. A tab delimited .txt file with apoptosis scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "Apoptosis score" shows the apoptosis score for each tumor region or sample. Example file "Apoptosis_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.

  16. A .tiff image file visualizing the apoptosis scores as a heatmap. Example file "Apoptosis_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

  17. A tab delimited .txt file with drug-resistance scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "Pemetrexed resistance score" shows the drug-resistance score for each tumor region or sample. Example file "Pemetrexed_resistance_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.

  18. A .tiff formatted image file visualizing the drug-resistance scores as a heatmap. Example file "Pemetrexed_resistance_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

  19. A tab delimited .txt file with epithelial and mesenchymal scores. First column is "Gene", followed by one column for each tumor region or sample. Three rows, one for sample names, one called "Epithelial score" shows the epithelial score for each tumor region or sample, one called "Mesenchymal score" shows the mesenchymal score for each tumor region or sample. Example file "Epithelial_Mesenchymal_scores.txt" can be seen under metaITH/output_files/sig_analysis_outputs.

  20. A tab delimited .txt file with EMT scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "M-E score" shows the mesenchymal-epithelial score for each tumor region or sample. Example file "Mesenchymal-Epithelial_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.

  21. A .tiff image file visualizing the EMT scores as a heatmap. Example file "Mesenchymal-Epithelial_score_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

  22. A tab delimited text file with anti-PD1 favor scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "anti-PD1 favor score" shows the anti-PD1 favor score for each tumor region or sample. Example file "anti-PD1_favor_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.

  23. A .tiff image file visualizing the anti-PD1 scores as a heatmap. Example file "anti-PD1_favor_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

Example

metaITH_analysis("DNA_dendro_list.txt", "RNA_dendro_list.txt", "Immune_dendro_list.txt", "sample_names.txt", "RNA_matrix.txt")
read.table("DNA_distance_matrix_DNA_S1_frequency_matrix.txt", sep="\t", header=T)
read.table("Proliferation_score.txt", sep="\t", header=T)[,1:4]

Signature Utility Functions

NOTE: Functions in this section are included for information purposes only, as they are called by the function "signature_analysis". The recommended way to run an analysis on a specific signature, use the function "signature_analysis" and specify the desired signature as part of the parameter "type".


anti_pd1_favor_signature


Description
Calculates anti-PD1 favor score for each tumor sample using gene expression data [log2 (TPM+1)] for 28 genes that comprise of anti-PD1 favor signature (Gibney et al., 2016). This score is a measure of responsiveness against anti-PD1 therapy. Higher score represents higher responsiveness.

Usage
anti_pd1_favor_signature(z_score_matrix_file)

Arguments

Details
Takes in a z-score matrix created by function z_score_calculations. Outputs a table of anti-PD1 favor scores for each tumor sample or region and a heatmap of the anti-PD1 favor scores. For each gene, Z- scores corresponding to its expression in tumor is calculated based on mean and standard deviation of its expression in the normal samples, and then anti-PD1 favor score for each tumor sample is defined as mean z-scores of 28 genes involved. A list of the 28 genes, "anti-PD1_favor_geneset.txt", is included in metaITH/genesets.

Value
This function returns two files:
1. A tab-delimited text file with anti-PD1 scores. First column is "Gene," followed by one column for each tumor region or sample. Two rows, one for sample names, one called “anti-PD1 favor score” shows the anti-PD1 favor score for each tumor region or sample. Example file "anti-PD1_favor_score.txt" can be seen under output_files/sig_analysis_outputs.
2. A .tiff image file of a heatmap. Example file "anti-PD1_favor_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

Example

anti_pd1_favor_signature("z-scores_matrix.txt")  
read.table("anti-PD1_favor_score.txt", sep="\t", header=T)[,1:4]   

apoptosis_signature


Description
Apoptosis score is calculated for each tumor sample using gene expression data [log2 (TPM+1)] for 6 genes known to be associated with apoptosis. This score is a measure of apoptotic activity. Higher score represents more apoptosis.

Usage
apoptosis_signature(z_score_matrix_file)

Arguments

Details

Takes in a z-score matrix created by function z_score_calculations. Outputs a table of apoptosis scores for each tumor sample or region and a heatmap of the apoptotic scores. For each gene, Z- scores corresponding to its expression in tumor is calculated based on mean and standard deviation of its expression in the normal samples, and then apoptotic score for each tumor sample is defined as mean z-scores of 6 genes involved. A list of the 6 genes, "apoptosis_geneset.txt", is included in metaITH/genesets.

Value
This function returns two files:
1. A table of apoptosis scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "Apoptosis score" shows the apoptosis score for each tumor region or sample. Example file "Apoptosis_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs. 2. A .tiff image file of a heatmap. Example file "Apoptosis_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

Example

apoptosis_signature("z-scores_matrix.txt")  
read.table("Apoptosis_score.txt", sep="\t", header=T)[,1:4]  

drug_resistance_signature


Description
Drug resistance score is calculated for each tumor sample using gene expression data [log2 (TPM+1)] for 23 genes that comprise of Drug resistance signature (Hou et al., 2012). This score is a measure of Pemetrexed drug resistance. Higher score represents higher resistance.

Usage
drug_resistance_signature(z_score_matrix_file)

Arguments

Details
Takes in a z-score matrix created by function z_score_calculations. Outputs a table of Drug resistance scores for each tumor sample or region and a heatmap of the Drug resistance scores. For each gene, Z- scores corresponding to its expression in tumor is calculated based on mean and standard deviation of its expression in the normal samples, and then Drug resistance score for each tumor sample is defined as mean z-scores of 23 genes involved. A list of the 23 genes, "pemetrexed_resistance_geneset.txt", is included in metaITH/genesets.

Value
This function returns two files:
1. A tab delimited .txt file with Drug resistance scores. First column is “Gene”, followed by one column for each tumor region or sample. Two rows, one for sample names, one called “Drug resistance score” shows the Drug resistance score for each tumor region or sample. Example file "Pemetrexed_resistance_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.
2. A .tiff image file of a heatmap. Example file "Pemetrexed_resistance_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

Example

drug_resistance_signature("z-scores_matrix.txt")  
read.table("Pemetrexed_resistance_score.txt", sep="\t", header=T)[,1:4]  

emt_signature


Description
EMT score is calculated for each tumor sample using gene expression data [log2 (TPM+1)] for 60 genes that comprise of EMT signature (Ramaker et al., 2017). This gene signature includes 49 epithelial related genes (e-genes) and 11 mesenchymal related genes (m-genes). This score is a measure of epithelial/mesenchymal nature of the tumor. Higher score represents more mesenchymal like features.

Usage
emt_signature(z_score_matrix_file)

Arguments

Details
Takes in a z-score matrix created by function z_score_calculations. Outputs a table of EMT scores for each tumor sample and a heatmap of the EMT scores. For each of e-genes and m-genes, z- scores corresponding to its expression in tumor is calculated based on mean and standard deviation of its expression in the normal samples, and then EMT score for each tumor sample is defined as:
EM score = av. Z-score of m-genes – av. z-score of e-genes
A list of the 49 epithelial genes, "epithelial_geneset.txt", and a list of the 11 mesenchymal genes, "mesenchymal_geneset.txt" is included in metaITH/genesets.

Value
This function returns three files:
1. A table of emt scores. First column is "Gene", followed by one column for each tumor region or sample. Three rows, one for sample names, one called "Epithelial score" shows the epithelial score for each tumor region or sample, one called "Mesenchymal score" shows the mesenchymal score for each tumor region or sample. Example file "Epithelial_Mesenchymal_scores.txt" can be seen under metaITH/output_files/sig_analysis_outputs.
2. A table of mesenchymal-epithelial scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "M-E score" shows the mesenchymal-epithelial score for each tumor region or sample. Example file "Mesenchymal-Epithelial_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs. 3. A .tiff image file of a heatmap. Example file "Mesenchymal-Epithelial_score_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

Example

emt_signature("z-scores_matrix.txt")

read.table("Mesenchymal-Epithelial_score.txt", sep="\t", header=T)[,1:4]

hypoxia_signature


Description
Hypoxia score is calculated for each tumor sample using gene expression data [log2 (TPM+1)] for 21 genes that comprise of Hypoxia signature (Buffa et al., 2010). This score is a measure of extent of hypoxia in tumors. Higher score represents more hypoxia.

Usage
hypoxia_signature(z_score_matrix_file)

Arguments

Details
Takes in a z-score matrix created by function z_score_calculations. Outputs a table of Hypoxia scores for each tumor sample or region and a heatmap of the Hypoxia scores. For each gene, Z- scores corresponding to its expression in tumor is calculated based on mean and standard deviation of its expression in the normal samples, and then Hypoxia score for each tumor sample is defined as mean z-scores of the genes involved. A list of the 21 genes involved, "hypoxia_geneset.txt", is included in metaITH/genesets.

Value
This function returns two files:
1. A table of hypoxia scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "Hypoxia score" shows the hypoxia score for each tumor region or sample. Example file "Hypoxia_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.
2. A .tiff image file of a heatmap. Example file "Hypoxia_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

Example

hypoxia_signature("z-scores_matrix.txt")

read.table("Hypoxia_score.txt", sep="\t", header=T)[,1:4]

proliferation_signature


Description
Proliferation score is calculated for each tumor sample using gene expression data [log2 (TPM+1)] for 121 genes that comprise of Proliferation signature (Wilkerson et al., 2012). This score is a measure of proliferation potential. Higher score represents higher proliferation potential.

Usage
proliferation_signature(z_score_matrix_file)

Arguments

Details
Takes in a z-score matrix created by function z_score_calculations. Outputs a table of Proliferation scores for each tumor sample and a heatmap of the Proliferation scores. For each gene, Z- scores corresponding to its expression in tumor is calculated based on mean and standard deviation of its expression in the normal samples, and then Proliferation score for each tumor sample is defined as mean z-scores of the genes involved. A list of the 121 genes involved, "proliferation_geneset.txt", is included in metaITH/genesets.

Value
This function returns two files:
1. A table of proliferation scores. First column is "Gene", followed by one column for each tumor region or sample. Two rows, one for sample names, one called "Proliferation score" shows the proliferation score for each tumor region or sample. Example file "Proliferation_score.txt" can be seen under metaITH/output_files/sig_analysis_outputs.
2. A .tiff image file of a heatmap. Example file "Proliferation_scores_heatmap.tiff" can be seen under metaITH/output_files/sig_analysis_outputs.

Example

proliferation_signature("z-scores_matrix.txt")
read.table("Proliferation_score.txt", sep="\t", header=T)[,1:4]

\newpage

Index

Dendrograms
       dna_dendrograms, 6
       immune_dendrograms, 7
       rna_dendrograms, 9
       snv_heatmaps, 9
Signatures
       signature_analyis, 2
       specified_geneset_analysis, 5
       z_score_calculations, 4
Multi-Level Divergence and Diversity
       multi_level_divergence_diversity, 9
Utilities
       metaITH_analysis, 10
Signature Utilities
       anti_pd1_favor_signature, 14
       apoptosis_signature, 15
       drug_resistance_signature, 16
       emt_signature, 17
       hypoxia_signature, 18
       proliferation_signature, 18

\newpage

References

  1. Gibney, G.T., Weiner, L.M., and Atkins, M.B. (2016). Predictive biomarkers for checkpoint inhibitor-based immunotherapy. Lancet. Oncol. 17, e542–e551.
  2. Hou, J., Lambers, M., den Hamer, B., den Bakker, M.A., Hoogsteden, H.C., Grosveld, F., Hegmans, J., Aerts, J., and Philipsen, S. (2012). Expression Profiling-Based Subtyping Identifies Novel Non-small Cell Lung Cancer Subgroups and Implicates Putative Resistance to Pemetrexed Therapy. J. Thorac. Oncol. 7, 105–114
  3. Ramaker, R.C., Lasseigne, B.N., Hardigan, A.A., Palacio, L., Gunther, D.S., Myers, R.M., and Cooper, S.J. (2017). RNA sequencing-based cell proliferation analysis across 19 cancers identifies a subset of proliferation-informative cancers with a common survival signature. Oncotarget 8, 38668–38681.
  4. Wilkerson, M.D., Yin, X., Hoadley, K.A., Liu, Y., Hayward, M.C., Cabanski, C.R., Muldrew, K., Miller, C.R., Randell, S.H., Socinski, M.A., et al. (2010). Lung squamous cell carcinoma mRNA expression subtypes are reproducible, clinically important, and correspond to normal cell types. Clin. Cancer Res. 16, 4864–4875.
  5. Buffa, F. M., Harris, A. L., West, C. M., & Miller, C. J. (2010). Large meta-analysis of multiple cancers reveals a common, compact and highly prognostic hypoxia metagene. British journal of cancer, 102(2), 428–435.
knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>"
)


sjdlabgroup/metaITH documentation built on Nov. 5, 2019, 8:58 a.m.