debit_map_seq | R Documentation |
debit_map_seq()
applies DEBIT with values surrounding the input values.
This provides an easy and powerful way to assess whether small errors in
computing or reporting may be responsible for DEBIT inconsistencies in
published statistics.
debit_map_seq(
data,
x = NULL,
sd = NULL,
n = NULL,
var = Inf,
dispersion = 1:5,
out_min = "auto",
out_max = NULL,
include_reported = FALSE,
include_consistent = FALSE,
...
)
data |
A data frame that |
x , sd , n |
Optionally, specify column names in |
var |
String. Names of the columns that will be dispersed. Default is
|
dispersion |
Numeric. Sequence with steps up and down from the |
out_min , out_max |
If specified, output will be restricted so that it's
not below |
include_reported |
Logical. Should the reported values themselves be
included in the sequences originating from them? Default is |
include_consistent |
Logical. Should the function also process
consistent cases (from among those reported), not just inconsistent ones?
Default is |
... |
Arguments passed down to |
A tibble (data frame) with detailed test results.
audit_seq()
You can call audit_seq()
following debit_map_seq()
. It will return a data frame with these
columns:
x
, sd
, and n
are the original inputs,
tested for consistency
here.
hits_total
is the total number of DEBIT-consistent value sets
found within the specified dispersion
range.
hits_x
is the number of DEBIT-consistent value sets
found by varying x
.
Accordingly with sd
and hits_sd
as well as n
and hits_n
.
(Note that any consistent reported cases will be counted by the
hits_*
columns if both include_reported
and include_consistent
are
set to TRUE
.)
diff_x
reports the absolute difference between x
and the next
consistent dispersed value (in dispersion steps, not the actual numeric
difference). diff_x_up
and diff_x_down
report the difference to the
next higher or lower consistent value, respectively.
diff_sd
, diff_sd_up
, and diff_sd_down
do the same for sd
.
Likewise with diff_n
, diff_n_up
, and diff_n_down
.
Call audit()
following audit_seq()
to summarize results even further.
It's mostly self-explaining, but na_count
and na_rate
are the number
and rate of times that a difference could not be computed because of a lack
of corresponding hits within the dispersion
range.
# `debit_map_seq()` can take any input
# that `debit_map()` can take:
pigs3
# Results from testing some few rows:
out <- pigs3 %>%
dplyr::slice(3:4) %>%
debit_map_seq()
out
# Case-wise summaries with `audit_seq()`
# can be more important than the raw results:
out %>%
audit_seq()
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.