clean_dataset: Clean Dataset

View source: R/clean_dataset.R

clean_datasetR Documentation

Clean Dataset

Description

Removes duplicate rows, standardizes column names and text values to uppercase or lowercase, and performs basic data cleaning on a data frame.

Usage

clean_dataset(
  df,
  variables = NULL,
  remove_duplicates = TRUE,
  convert_to_case = NULL
)

Arguments

df

A data frame to be cleaned.

variables

Optional; a vector of variable names to specifically clean. If NULL, applies cleaning to all variables.

remove_duplicates

Logical; whether to remove duplicate rows.

convert_to_case

Optional; convert character variables to "lower" or "upper" case.

Value

A cleaned data frame.

Examples


  df <- data.frame(name = c("Alice", "Bob", "Alice"),
                   score = c(90, 85, 90),
                   stringsAsFactors = FALSE)
  clean_dataset(df, remove_duplicates = TRUE, convert_to_case = "upper")


clinCompare documentation built on Feb. 19, 2026, 1:07 a.m.