h2o.drop_duplicates: Drops duplicated rows.

Description Usage Arguments Examples

View source: R/frame.R

Description

Drops duplicated rows across specified columns.

Usage

1
h2o.drop_duplicates(frame, columns, keep = "first")

Arguments

frame

An H2OFrame object to drop duplicates on.

columns

Columns to compare during the duplicate detection process.

keep

Which rows to keep. The "first" value (default) keeps the first row and deletes the rest. The "last" keeps the last row.

Examples

1
2
3
4
5
6
7
8
## Not run: 
library(h2o)
h2o.init()

data <- as.h2o(iris)
deduplicated_data <- h2o.drop_duplicates(data, c("Species", "Sepal.Length"), keep = "first")

## End(Not run)

h2o documentation built on Jan. 4, 2022, 1:09 a.m.