reduce_to_unique: Reduce to unique cases

View source: R/utils_python_data_management.R

reduce_to_uniqueR Documentation

Reduce to unique cases

Description

Function creates an arrow data set that contains only unique cases. That is, duplicates are removed.

Usage

reduce_to_unique(dataset_to_reduce, column_name)

Arguments

dataset_to_reduce

Object of class datasets.arrow_dataset.Dataset.

column_name

string Name of the column whose values should be unique.

Value

Returns a data set of class datasets.arrow_dataset.Dataset where the duplicates are removed according to the given column.

See Also

Other Utils Python Data Management Developers: class_vector_to_py_dataset(), data.frame_to_py_dataset(), get_batches_index(), prepare_r_array_for_dataset(), py_dataset_to_embeddings(), tensor_list_to_numpy(), tensor_to_numpy()


aifeducation documentation built on Nov. 19, 2025, 5:08 p.m.