MultiAssayExperiment-helpers | R Documentation |
A set of helper functions were created to help clean and
manipulate a MultiAssayExperiment object. intersectRows
also works
for ExperimentList
objects.
complete.cases: Returns a logical vector corresponding to 'colData' rows that have data across all experiments
isEmpty: Returns a logical TRUE
value for zero length
MultiAssayExperiment
objects
intersectRows: Takes all common rows across experiments, excludes experiments with empty rownames
intersectColumns: A wrapper for complete.cases
to return a
MultiAssayExperiment
with only those biological units that have
measurements across all experiments
replicated: Identifies, via logical vectors, colname
s that
originate from a single biological unit within each assay
replicates: Provides the replicate colname
s found with
the replicated
function by their name, empty list if none
anyReplicated: Whether the assay has replicate measurements
showReplicated: Displays the actual columns that are replicated per
assay and biological unit, i.e., primary
value (colData
rowname) in the sampleMap
mergeReplicates: A function that combines replicated / repeated measurements across all experiments and is guided by the replicated return value
longFormat: A MultiAssayExperiment
method that
returns a small and skinny DataFrame
. The
colDataCols
arguments allows the user to append colData
columns to
the data.
wideFormat: A function to reshape the data in a
MultiAssayExperiment
to a "wide" format
DataFrame
. Each row in the DataFrame
represents an observation (corresponding to an entry in the colData
).
If replicates are present, their data will be appended at
the end of the corresponding row and will generate additional NA
data.
It is recommended to remove or consolidate technical replicates with
mergeReplicates
. Optional colDataCols
can be added when the
original object is a MultiAssayExperiment
.
hasRowRanges: A function that identifies ExperimentList elements
that have a
rowRanges
method
getWithColData: A convenience function for extracting an assay and associated colData
renamePrimary: A convenience function to rename the primary
biological units as represented in the rownames(colData)
renameColname: A convenience function to rename the colnames of a particular assay
## S4 method for signature 'MultiAssayExperiment'
complete.cases(...)
## S4 method for signature 'MultiAssayExperiment'
isEmpty(x)
intersectRows(x)
intersectColumns(x)
replicated(x)
## S4 method for signature 'MultiAssayExperiment'
replicated(x)
anyReplicated(x)
## S4 method for signature 'MultiAssayExperiment'
anyReplicated(x)
showReplicated(x)
## S4 method for signature 'MultiAssayExperiment'
showReplicated(x)
replicates(x, ...)
## S4 method for signature 'MultiAssayExperiment'
replicates(x, ...)
mergeReplicates(x, replicates = list(), simplify = BiocGenerics::mean, ...)
## S4 method for signature 'MultiAssayExperiment'
mergeReplicates(
x,
replicates = replicated(x),
simplify = BiocGenerics::mean,
...
)
## S4 method for signature 'ANY'
mergeReplicates(x, replicates = list(), simplify = BiocGenerics::mean, ...)
longFormat(object, colDataCols = NULL, i = 1L, ...)
## S4 method for signature 'ANY'
longFormat(object, colDataCols = NULL, i = 1L, ...)
## S4 method for signature 'ExperimentList'
longFormat(object, colDataCols = NULL, i = 1L, ...)
## S4 method for signature 'MultiAssayExperiment'
longFormat(object, colDataCols = NULL, i = 1L, ...)
wideFormat(
object,
colDataCols = NULL,
check.names = TRUE,
collapse = "_",
i = 1L
)
hasRowRanges(x)
## S4 method for signature 'MultiAssayExperiment'
hasRowRanges(x)
## S4 method for signature 'ExperimentList'
hasRowRanges(x)
getWithColData(x, i, mode = c("append", "replace"), verbose = FALSE)
renamePrimary(x, value)
renameColname(x, i, value)
splitAssays(x, hitList)
## S4 method for signature 'MultiAssayExperiment'
splitAssays(x, hitList)
makeHitList(x, patternList)
... |
Additional arguments. See details for more information. |
x |
A MultiAssayExperiment or ExperimentList |
replicates |
A list of |
simplify |
A function for merging repeat measurements in experiments
as indicated by the |
object |
Any supported class object |
colDataCols |
A |
i |
longFormat: The i-th assay in
|
check.names |
(logical default TRUE) Column names of the output
|
collapse |
(character default "_") A single string delimiter for output
column names. In |
mode |
String indicating how |
verbose |
|
value |
renamePrimary: A |
hitList |
a named |
patternList |
a named |
The replicated
function finds replicate measurements in each
assay and returns a list of LogicalList
s.
Each element in a single LogicalList
corresponds to
a biological or primary unit as in the sampleMap
. Below is a
small graphic for one particular biological unit in one assay, where the
logical vector corresponds to the number of measurements/samples in the
assay:
> replicated(MultiAssayExperiment) (list str) '-- $ AssayName (LogicalList str) '-- [[ "Biological Unit" ]] Replicated if sum(...) > 1 '-- TRUE TRUE FALSE FALSE
anyReplicated
determines if any of the assays have at least one
replicate. Note. These methods are not available for the
ExperimentList
class due to a missing sampleMap
structure
(by design).
showReplicated
returns a list of CharacterList
s
where each element corresponds to the the biological or primary units that
are replicated in that assay element. The values in the inner list are
the colnames
in the assay that are technical replicates.
The replicates
function (noun) returns the colname
s
from the sampleMap
that were identified as replicates. It returns a
list of CharacterList
s for each assay present in
the MultiAssayExperiment
and an inner entry for each biological unit that
has replicate observations in that assay.
The mergeReplicates
function is a house-keeping method
for a MultiAssayExperiment
where only complete.cases
are
returned. This by-assay operation averages replicate measurements
(by default) and columns are aligned by the row order in colData
.
Users can provide their own function for merging replicates with the
simplify
functional argument. Additional inputs ...
are
sent to the 'simplify' function.
The mergeReplicates
"ANY" method consolidates duplicate
measurements for rectangular data structures, returns object of the same
class (endomorphic). The ellipsis or ...
argument allows the
user to provide additional arguments to the simplify
functional
argument.
The longFormat
"ANY" class method, works with classes such as
ExpressionSet
and
SummarizedExperiment
as
well as matrix
to provide a consistent long and skinny
DataFrame
.
The hasRowRanges
method identifies assays that support
a rowRanges
method and return a GRanges
object.
See the itemized list in the description section for details.
The mergeReplicates
function makes use of the output from
replicated
which will point out the duplicate measurements by
biological unit in the MultiAssayExperiment
. This function will
return a MultiAssayExperiment
with merged replicates. Additional
arguments can be provided to the simplify argument via the ellipsis
(...
). For example, when replicates "TCGA-B" and "TCGA-A" are found in
the assay, the name of the first appearing replicate is taken (i.e., "B").
Note that a typical use case of merging replicates occurs when there are
multiple measurements on the same sample (within the same assay)
and can therefore be averaged.
The 'longFormat' method takes data from the ExperimentList
in a MultiAssayExperiment
and returns a uniform
DataFrame
. The resulting DataFrame has columns indicating
primary, rowname, colname and value. This method can optionally include
columns of the MultiAssayExperiment colData named by colDataCols
character
vector argument. (MultiAssayExperiment
method only). The i
argument
allows the user to specify the assay value for the
SummarizedExperiment
assay function's i
argument.
The wideFormat
function returns standardized wide DataFrame
where each row represents a biological unit as in the colData
.
Depending on the data and setup, biological units can be patients, tumors,
specimens, etc. Metadata columns are
generated based on the names produced in the wide format
DataFrame
. These can be accessed via the
mcols()
function.
See the wideFormat
section for description of the colDataCols
and
i
arguments.
The hasRowRanges
method identifies assays with associated ranged
row data by directly testing the method on the object. The result from the
test must be a GRanges
class object to
satisfy the test.
The getWithColData
function allows the user to conveniently extract
a particular assay as indicated by the i
index argument. It
will also attempt to provide the
colData
along with the extracted object using the colData<-
replacement
method when possible. Typically, this method is available for
SummarizedExperiment
and RaggedExperiment
classes.
The setting of mode
determines how the colData
is added. If mode="append"
, the MultiAssayExperiment
metadata is appended onto that of the SummarizedExperiment
.
If any fields are duplicated by name, the values in the
SummarizedExperiment
are retained, with a warning emitted if
the values are different. For mode="replace"
, the
MultiAssayExperiment
metadata replaces that of the
SummarizedExperiment
, while for mode="none"
,
no replacement or appending is performed.
The renamePrimary
function allows the user to conveniently change the
actual names of the primary biological units as seen in
rownames(colData)
. renameColname
allows the user to change the
names of a particular assay based on index i
. i
can either be
a single numeric or character value. See colnames<-
method for
renaming multiple colnames in a MultiAssayExperiment
.
The splitAssays
method separates columns in each of the assays based
on the hitList
input. The hitList
can be generated using
the makeHitList
helper function. To use the makeHitList
helper, the user should input a list of patterns that will match on the
column names of each assay. These matches should be mutually exclusive as
to avoid repetition of columns across assays. See the examples section.
example(MultiAssayExperiment)
complete.cases(mae)
isEmpty(MultiAssayExperiment())
## renaming biological units (primary)
mae2 <- renamePrimary(mae, paste0("pt", 1:4))
colData(mae2)
sampleMap(mae2)
## renaming observational units (colname)
mae2 <- renameColname(mae, i = "Affy", paste0("ARRAY", 1:4))
colnames(mae2)
sampleMap(mae2)
patts <- list(
normals = "TCGA-[A-Z0-9]{2}-[A-Z0-9]{4}-11",
tumors = "TCGA-[A-Z0-9]{2}-[A-Z0-9]{4}-01"
)
data("miniACC")
hits <- makeHitList(miniACC, patts)
## only turmors present
splitAssays(miniACC, hits)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.