read_expression_data | R Documentation |
Reads gene expression matrix data file which is either tab/comma/white-space value separated text file. After reading in the input file, any gene rows that contain NAs are removed and the data set is screened for duplicates. Duplicate genes are reduced to single gene row entries, with each value corresponding to the mean expression value for each sample for the duplicated gene entry. If duplicate samples are detected then the only the first sample of the duplicates are retained. Finally, the data frame is converted to a numerical matrix, where row names represent gene symbols and columns represent sample names or IDs.
read_expression_data(file)
file |
Path for gene expression matrix file, which is either tab / comma / white-space value separated. The first column must contain gene symbols and the subsequent column names should be the sample name or ID. |
A numerical matrix containing gene symbols/IDs as row names and sample IDs as column names.
Taniya Pal taniya.pal.094@cranfield.ac.uk
## Not run: read_expr_data("/Users/taniyapal/Documents/Group Project/TCGA_unannotated.txt")
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.