ws.trom: Within-species transcriptome mapping

Description Usage Arguments Details Value Author(s) References See Also Examples

Description

This function calculates the TROM scores in comparing/mapping samples from the same species. TROM score = -log10(Bonferroni-corrected p-value from a hypergeometric test), with a minimum value of 0.

Usage

1
2
3
ws.trom(sp_gene_expr = data.frame(), single = TRUE, sp_gene_expr2 = NULL,
        z_thre = 1.5, provide = FALSE, gene_lists = "", 
        save_overlap_genes = FALSE)

Arguments

sp_gene_expr

a data frame containing gene expression estimates of the species; rows correspond to genes; columns (from the second to the last) correspond to samples, with the first column as gene IDs. Not needed if provide = TRUE.

single

a Boolean value indicating whether the within-species comparison will be conducted on a single dataset. If single = FALSE, the users need to provide sp_gene_expr2. Or if provide = TRUE, users need have a second sheet in the .xlsx file (that is to be supplied into gene_lists) to specify the second set of gene lists corresponding to another set of biological samples of the same species. Defaults to TRUE.

sp_gene_expr2

NULL (default) or a data frame containing gene expression estimates of the species; rows correspond to genes; columns (from the second to the last) correspond to samples, with the first column as gene IDs.

z_thre

a numeric value specifiying the Z-score threshold used to select associated genes, whose Z-scores z_thre. Defaults to 1.5. This can be specified by users or calculated using choose.z().

provide

a Boolean value indicating whether associated genes are user-provided. If provide = TRUE, the users need to provide lists of genes that they think can represent the transcriptome characteristics of different samples.

gene_lists

an .xlsx file containing user-provided gene lists. It is required when provide = TRUE.

save_overlap_genes

a Boolean value indicating whether the users want to save overlap genes between every two samples from the species to an .xlsx file. If save_overlap_genes = TRUE, this function outputs the overlap genes to "within-species overlapping genes between sample pairs.xlsx". Defaults to FALSE.

Details

If provide = TRUE, users provide gene lists with gene_lists to calculate the TROM scores; otherwise, the function will automatically select associated genes based on the criterion: Z-scores z_thre.

If single = TRUE, TROM socres are calculated from single dataset of the species. Users should either specify sp_gene_expr or provide gene_lists. If provided, gene_lists should be a one-sheet Excel file. In the Excel file, rows represent gene ids and columns represent biological samples. Each column of the file stores the user-provided genes corresponding to the sample of that column. Please note that different columns may have different numbers of rows.

If single = FALSE, TROM socres are calculated from two different datasets of the species. Users should either specify sp_gene_expr and sp_gene_expr2 or provide gene_lists. If provided, gene_lists should be a two-sheet Excel file with the first sheet for one dataset and the second sheet for the other dataset. In each sheet, rows represent gene ids and columns represent biological samples. Each column of the file stores the user-provided genes corresponding to the sample of that column.

This function outputs the within-species TROM scores into an .xlsx file named "within-species TROM scores.xlsx".

Value

A matrix of within-species TROM scores, where rows and columns correspond to the samples of the species respectively.

Author(s)

Jingyi Jessica Li, Wei Vivian Li

References

Li WV, Chen Y and Li JJ (2016). TROM: A Testing-Based Method for Finding Transcriptomic Similarity of Biological Samples. Statistics in Biosciences. DOI: 10.1007/s12561-016-9163-y

Li JJ, Huang H, Bickel PJ, & Brenner SE (2014). Comparison of D. melanogaster and C. elegans developmental stages, tissues, and cells by modENCODE RNA-seq data. Genome Research, 24(7), 1086-1101.

See Also

bs.trom, ws.trom.orthologs

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
## Calculating transcriptome overlap measure within D. melanogaster 

## dm_gene_expr.rda and dm_tissue_expr.rda
## can be downloaded and unzipped from
## http://www.stat.ucla.edu/~jingyi.li/packages/TROM/TROM_Rdata.zip

## Not run: 
load("dm_gene_expr.rda")
## without user-provided gene lists
# use single dataset
dm_trom <- ws.trom(sp_gene_expr = dm_gene_expr, z_thre = 1.5, 
                     provide = FALSE, save_overlap_genes = TRUE)

# use two different dataset
# calculate TROM scores between timecourse and tissue/cell line 
# data within D. melanogaster 
load("dm_tissue_expr.rda")
dm_trom2 <- ws.trom(sp_gene_expr = dm_gene_expr, single = FALSE, 
                    sp_gene_expr2 = dm_tissue_expr, z_thre = 1.5,
                    provide = FALSE, save_overlap_genes = FALSE)
                       
## with user-provided gene lists
gene_lists <- system.file("dm_associated_genes.xlsx", package = "TROM")
dm_trom3 <- ws.trom(provide = TRUE, gene_lists = gene_lists)

## End(Not run)

TROM documentation built on May 1, 2019, 8:07 p.m.