investigate_metadata: investigate_metadata

Description Usage Arguments Value Examples

View source: R/investigate_metadata.R

Description

A function to investigate the metadata based on its missingness and variance. A missingness plot will be outputted to get a general sense of the amount of missingness in the metadata. In addition, the percentage of missingness for the variables and rows as well as variables that lack variance will be returned. Finally, using this information, the function will provide information on which variables and rows should be dropped.

Usage

1
2
3
4
5
6
investigate_metadata(
  metadata,
  first_column_as_id = TRUE,
  missing_value_lst = NULL,
  missing_threshold = 0.1
)

Arguments

metadata

The corresponding metadata for a gene count matrix.

first_column_as_id

Boolean value specifying whether the first column in the metadata is the identifier/key. If not, it is assumed that the row names are.

missing_value_lst

A named character list specifying the missing value(s), if it exists, in each variable.

missing_threshold

A value between 0 and 1 inclusive signifying the threshold and cutoff for the percentage of acceptable missingness in each variable.

Value

A list containing 4 objects of relevant metadata info.

Examples

1
2
3
4
5
6
7
8
# Using tcga_metadata from package.
library(MetaConIdentifier)
meta_info <- investigate_metadata(tcga_meta_original,
first_column_as_id = FALSE, missing_value_lst = NULL,
missing_threshold = 0.1)

# Obtain missingness percentages for each variable.
meta_info$missing_percent_col

ahnjedid/MetaConIdentifier documentation built on Dec. 18, 2021, 11:26 p.m.