(1.) This currently estimates data per pipeline (cannot estimate a subset by feature i.e. genes or biosamples)
(2.) The estimate is a rough estimate e.g. RNA-seq data will have estimate for multiple
columns that are eventually not present in the expression matrix (e.g. dataset_id, dataset_version,
biosample_id, feature_id, measurementset_id
). Some of these columns are important to link
metadata to the measurement data, and only those are eventually downloaded from the
database.
(3.) For now this is only used to estimate the data size in the measurement array
(not the feature and any other metadata that might also be downloaded to form a compound
object e.g. search_expression
returns a Bioconductor ExpressionSet object).
1 2 3 4 5 | estimate_measurementdata_download_size(
measurementset,
units = c("MB", "KB", "GB"),
con = NULL
)
|
measurementset |
measurementset dataframe – e.g. output of |
units |
return estimated size as either of " |
con |
the connection object (optional) |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.