validate: Validate the duplicates using the "groups" column to identify...
In gmyrland/fduper: Eliminate Duplicate Files

Description Usage Arguments Value Examples

Groups the files using the "groups" column, then performs extra validation of the groups. In practice, comparing file hashes is probably enough to identify duplicates for most purposes, but performing the extra validation check can privde extra peace of mind. The algorithm works by reading the files in small chunks, hashing the chunks, and comparing them. Successful validation requires that the hash of each chunk be identical.

1	validate(.data, ...)

`.data`	A fduper object
`chunk_size`	The size of chunks to read, with default of 1e6 bytes

The validated fduper object, or stops execution if a set doesn't pass validation

1	f %>% identify(hash) %>% validate(hash)

gmyrland/fduper documentation built on May 28, 2019, 8:53 p.m.

gmyrland/fduper index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

gmyrland/fduper
Eliminate Duplicate Files

validate: Validate the duplicates using the "groups" column to identify...
In gmyrland/fduper: Eliminate Duplicate Files

Description

Usage

Arguments

Value

Examples

Related to validate in gmyrland/fduper...

R Package Documentation

Browse R Packages

We want your feedback!

gmyrland/fduper Eliminate Duplicate Files

validate: Validate the duplicates using the "groups" column to identify... In gmyrland/fduper: Eliminate Duplicate Files

Description

Usage

Arguments

Value

Examples

Related to validate in gmyrland/fduper...

R Package Documentation

Browse R Packages

We want your feedback!

gmyrland/fduper
Eliminate Duplicate Files

validate: Validate the duplicates using the "groups" column to identify...
In gmyrland/fduper: Eliminate Duplicate Files