createRevelioObject: Create Revelio Object.

Description Usage Arguments Details Value

View source: R/functions_InitializationCCPhaseAssignFiltering.R

Description

'createRevelioObject' returns a Revelio object according to the specified parameters.

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
createRevelioObject(
  rawData,
  cyclicGenes,
  datasetID = "data1",
  cellType = "unknown",
  sequTechnique = "unknown",
  lowernGeneCutoff = 1200,
  uppernUMICutoff = 10^7,
  ccPhaseAssignBasedOnIndividualBatches = TRUE,
  ccPhaseAssignThresholdForCorSingleGeneToAvgExpression = 0.2,
  ccPhaseAssignMaxDiffOfCCPhasesBetweenHighestScores = 1,
  ccPhaseAssignThresholdHighestPhaseScore = 0.75,
  ccPhaseAssignThresholdSecondHighestPhaseScore = 0.5,
  pcaGenes = "variableGenes",
  ccDurationG1 = 6.88,
  ccDurationS = 8.38,
  ccDurationG2 = 3.05,
  ccDurationM = 1.02,
  ccDurationTotal = 19.33
)

Arguments

rawData

A genes-by-cells matrix of raw sequencing data. Rows should contain gene names, columns should contain cell IDs, entries of the matrix are integer numbers.

cyclicGenes

A matrix containing marker genes for known cell cycle phases. Each column contains a list of known marker genes corresponding to a specific cell cycle phase/phase transition. The name of that transition should be given as the column name.

datasetID

(Optional) String to identify the current analysis.

cellType

(Optional) String that indentifies the cell type of the data that is being analyzed.

sequTechnique

(Optional) String that identifies the sequencing technique used to obtain the raw data.

lowernGeneCutoff

Integer number that defines the minimum number of detected genes per cell. (Lower threshold for quality control.)

uppernUMICutoff

Integer number that defines the maximum number of UMIs per cell. (Upper threshold for removing possible doublets.)

ccPhaseAssignBasedOnIndividualBatches

In case the data is made up of different batches, this boolean decides whether the in silico phase assignment is done for each batch individually or for the entire data set simultaneously. Batch separation during phase assignment is advised since batch effects may otherwise influence phase assignment.

ccPhaseAssignThresholdForCorSingleGeneToAvgExpression

During phase assignment, population expression for individual genes is compared to average population expression of the entire marker gene list. If the correlation is lower than this defined parameter, it is concluded that this gene behaves differently in the investigated data set than expected and is consequently removed from the marker gene set.

ccPhaseAssignMaxDiffOfCCPhasesBetweenHighestScores

If set to 1, it means that the second highest z-score during phase assignment must be contained in one of the neighbouring cell cycle phase to where the highest score is detected.

ccPhaseAssignThresholdHighestPhaseScore

Each cell is assigned a z-score for each of the cell cycle phases contained in 'cyclicGenes'. The phase corresponding to the highest score wins. The highest phase score has to pass this parameter threshold, otherwise the cell is filtered out as its gene expression is cell cycle markers is not strong enough.

ccPhaseAssignThresholdSecondHighestPhaseScore

See 'ccPhaseAssignThresholdHighestPhaseScore'. This parameter defines a similar threshold but for the second highest score.

pcaGenes

If 'variableGenes' then only variable genes are utilized during application of PCA. If 'allGenes' then all genes detected during sequencing are used.

ccDurationG1

Double that estimates the duration in hours of G1 phase. Default value corresponds to at typical HEK cycle.

ccDurationS

Double that estimates the duration in hours of S phase. Default value corresponds to at typical HEK cycle.

ccDurationG2

Double that estimates the duration in hours of G2 phase. Default value corresponds to at typical HEK cycle.

ccDurationM

Double that estimates the duration in hours of M phase. Default value corresponds to at typical HEK cycle.

ccDurationTotal

Double that estimates the duration in hours of the cell cycle. Default value corresponds to at typical HEK cycle.

Details

A matrix of raw sequencing data as well as a table of marker gene lists have to be provided.

Value

Returns a Revelio object containing the custom sequencing data for further analysis.


danielschw188/Revelio documentation built on April 30, 2021, 1:02 p.m.