meltt_duplicates: Return identified duplicate entries removed after...

View source: R/meltt_duplicates.R

meltt_duplicatesR Documentation

Return identified duplicate entries removed after integration.

Description

meltt_duplicates returns all matching entries that are identified as matches during the integration process.

Usage

meltt_duplicates(object, columns = NULL)

Arguments

object

object of class meltt.

columns

string vector referencing column names located in the input data. Default is to return all columns contained in the input data.

Details

meltt_duplicates returns all duplicated entries along with specified columns to compare which entries matched. Function allows for easy extraction all entries identified as duplicates.

Value

Returns an data.frame where the first columns contain an index for the data.source and event for each data frame. The subsequent columns are all columns specified in the columns argument, or all columns contained in the original input data if columns = NULL.

An "event_type" column is added to the output data.frame specifying if the match was between events or episode. See meltt_inspect for handling flagged event-to-episode matches.

Author(s)

Karsten Donnay and Eric Dunford.

References

Karsten Donnay, Eric T. Dunford, Erin C. McGrath, David Backer, David E. Cunningham. (2018). "Integrating Conflict Event Data." Journal of Conflict Resolution.

See Also

meltt, meltt_data, meltt_inspect

Examples


data(crashMD)
output = meltt(crash_data1, crash_data2, crash_data3,
                taxonomies = crash_taxonomies, twindow = 1, spatwindow = 3)
duplicates = meltt_duplicates(output, column = c("date", "longitude", "latitude"))
head(duplicates)


meltt documentation built on Oct. 27, 2022, 1:05 a.m.