find_similar | R Documentation |
This is used to identify columns in the two data frames that might be the same. This will only be meaningfull if the rows of the two data frames correspond to each other in some way i.e. they are sorted appropriately.
find_similar(df1, df2 = NULL)
df1, df2 |
Two data frames with matching number of rows. If the argument |
The returned table summarises results with a row for each pair of columns with matching classes. There are
counts for: matches, both zero, one or both is NA
, and differences. The proportion of non-zero matches
is also given. This is the number of non-zero matches divided by the number of element pairs that don't
contain an NA
and are not both zero. Excluding matches which are both zeroes makes it easier to see
genuinely similar columns in data that contains lots of zeroes or missing values.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.