replacer
, a value replacement
utility currently based on package
data.table
,
is intended for outside-database update of data. It requires the
preparation of a lookup file which is a list of replacement requests
with and, in special circumstances, without an index column.
This utility is accessible to beginners to R
and facilitates complex
dataset updates with minimal prompt time by employing User-friendly
functions which automatically follow a decision tree rooted in User’s
input.
Data processing such as data cleanup, file format conversion and the appendage of new data to data files are outside the scope.
You can install the released version of replacer from CRAN with:
install.packages("replacer")
Below is a basic example which shows the screen result of data replacement using a lookup file without index:
require(replacer)
#> Loading required package: replacer
## basic example code
dir = system.file('extdata', package = 'replacer')
## update the 'data' dataset with new replacement values
replaceVals(dir, save = FALSE)
#>
#> reading data from: C:/R/R-4.1.2/library/replacer/extdata...
#> data reading complete ...
#> checking standard columns in lookup ...
#> starting replacements ...
#> found duplicates and simple replacements but no index in lookup :
#>
#> would recommend User-made index
#> proceeding any way ...
#> subsetting lookup and creating index ...
#> found request for 1:1 replacements: proceeding ...
#> completed 1:1 replacements ...
#> rejoining columns uninvolved in simple replacements ...
#> searching for 1:many replacements ...
#> found request for multiple duplicated value replacments: creating index ...
#> replacing multiple duplicated values ...
#> found request for multiple missing value replacements: replacing ...
#> processed 1:many replacements ...
#> there are still missing values in some involved columns!
#>
#> helper function has completed!
#> $` updated_data_using_lookup`
#> a b c d
#> 1: aa 7.174 0.259 11
#> 2: NOT PRESENT 17.572 0.478 5555
#> 3: DUP VALUE_1 8.888 0.707 NA
#> 4: dd 0.794 0.737 5555
#> 5: UU -13.964 999.000 5555
#> 6: DUP VALUE_1 0.127 0.737 8
#> 7: ff 8.836 0.476 5555
#> 8: NOT PRESENT 6.397 -0.001 8
#> 9: hh -4.979 0.121 9
#> 10: NOT PRESENT -2.755 -0.227 8
#> 11: EE 1.111 -0.127 6
#> 12: NOT PRESENT 1.121 0.333 7
#> 13: NOT PRESENT -4.979 0.737 5555
#> 14: NOT PRESENT 999.000 0.476 5555
#> 15: NOT PRESENT 1.111 999.000 8
#> 16: JJ 1.121 999.000 5555
#>
#> $` multiple_dups_repl_counts`
#> vars oldVals newVals a b d
#> 1: a cc DUP VALUE_1 2 NA NA
#> 2: b 2.142 1.111 NA 2 NA
#> 3: d 15 5555 NA NA 7
#>
#> $` NAs_remaining`
#> a b c d
#> 0 0 0 1
Familiarity with basic R
commands is necessary. Familiarity with
data.table
is not, yet strongly recommended!
Users are encouraged to start by reading the vignette.
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.