createPRPS | R Documentation |
This function is a part of the data analysis functionality of tcgaCleaneR
. It creates pseudo-replicates
of pseudo-samples (PRPS) for unwanted variations like Library size, Batches and Purity in TCGA Pan Cancer Datasets with
Cancer biology like Breast Cancer data (BRCA), Lung Cancer (LUAD), Colon Cancer (COAD) and Rectum Cancer (READ). In the
function batch refers to the source of Batch Effect variation like Time and Plate which captures variation across
biology while factors like Purity captures variation within biology.
createPRPS( expr.data, sample.info, librarySize, batch, biology, purity, include.ls, include.purity, minSamplesPerBatchPS, minSamplesForPuirtyPS, minSamplesForPurityPerBiology, minSamplesForLibrarySizePerBatch, minSamplesForLibrarySizePS )
expr.data |
S4 data object: Cancer Gene expression data |
sample.info |
S4 data object: Cancer data Sample information |
librarySize |
character: Library Size variable in input |
batch |
character: Batch effect factors. In current package version batch can take values like 'Year', 'Plate' or both |
biology |
character: Biology of cancer type. TCGA datasets have biology for only four Cancer types i.e. Lung (LUAD), Breast (BRCA), Rectum (READ) & Colon (COAD). So the function supports only these four datasets for RUV-III and PRPS analysis. Default is 'Subtypes'. |
purity |
character: Purity variable in input data object |
include.ls |
logical: Do we need to consider library size in creating pseudo samples |
include.purity |
logical: Do we need to consider purity in creating pseudo samples |
minSamplesPerBatchPS |
numeric: Minimum number of samples per batch for creating Pseudo Samples |
minSamplesForPuirtyPS |
numeric: Minimum number of samples for creating Pseudo Samples for purity. |
minSamplesForPurityPerBiology |
numeric: Number of samples for purity per biology for creating Pseudo Samples |
minSamplesForLibrarySizePerBatch |
numeric: Number of samples for library size per batch for creating Pseudo Samples |
minSamplesForLibrarySizePS |
numeric: Minimum number of samples for creating Pseudo Samples for library size |
A S4 list object with the Pseudo replicate for pseudo samples for different batches, library size and purity.
## Not run: createPRPS(expr.data, sample.info, librarySize = 'ls', batch=c('Year', 'Plates'), biology = 'Subtypes', purity='Purity_singscore',include.ls=TRUE, include.purity=TRUE, minSamplesPerBatchPS = 3, minSamplesForPuirtyPS = 3, minSamplesForPurityPerBiology = 12, minSamplesForLibrarySizePerBatch = 6,minSamplesForLibrarySizePS = 3) ## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.