Select only distinct/unique rows from a
Optional variables to use when determining uniqueness. If there are multiple rows for a given combination of inputs, only the first row will be preserved. If omitted, will use all variables.
data.frame with the following properties:
Rows are a subset of the input but appear in the same order.
Columns are not modified if
... is empty or
distinct() first calls
to create new columns.
Groups are not modified.
data.frame attributes are preserved.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25
df <- data.frame( x = sample(10, 100, rep = TRUE), y = sample(10, 100, rep = TRUE) ) nrow(df) nrow(distinct(df)) nrow(distinct(df, x, y)) distinct(df, x) distinct(df, y) # You can choose to keep all other variables as well distinct(df, x, .keep_all = TRUE) distinct(df, y, .keep_all = TRUE) # You can also use distinct on computed variables distinct(df, diff = abs(x - y)) # The same behaviour applies for grouped data frames, # except that the grouping variables are always included df <- data.frame( g = c(1, 1, 2, 2), x = c(1, 1, 2, 1) ) %>% group_by(g) df %>% distinct(x)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.