GRIMMER
In scrutiny: Error Detection in Science

knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>"
)
pkgload::load_all()

library(scrutiny)

Granularity-related inconsistency of means mapped to error repeats, or GRIMMER, is a test for the mathematical consistency of reported means or proportions with the corresponding standard deviations (SDs) and sample sizes [@anaya2016b; @allard2018].

GRIMMER builds up on GRIM [@brown_grim_2017]. Indeed, the elegant Analytic-GRIMMER algorithm [@allard2018] implemented here tests for GRIM-consistency before conducting its own unique tests.

This vignette covers scrutiny's implementation of the GRIMMER test. It's an adapted version of the GRIM vignette because both the tests themselves and their implementations in scrutiny are very similar. If you are familiar with scrutiny's grim_*() functions, much of the present vignette will seem quite natural to you.

The vignette has the following sections --- to get started, though, you only need the first one:

The basic grimmer() function and a specialized mapping function, grimmer_map().
The audit() method for summarizing grimmer_map()'s results.
The visualization function grim_plot(), which also works for GRIMMER.
Testing numeric sequences with grimmer_map_seq().
Handling unknown group sizes with grimmer_map_total_n().

Basic GRIMMER testing

Few cases: `grimmer()`

To test if a reported mean of 7.3 on a granular scale is GRIMMER-consistent with an SD of 2.51 and a sample size of 12, run this:

grimmer(x = "7.3", sd = "2.51", n = 12)

Note that x, the reported mean, needs to be a string. The reason is that strings preserve trailing zeros, which can be crucial for GRIMMER-testing. Numeric values don't, and even converting them to strings won't help. A workaround for larger numbers of such values, restore_zeros(), is discussed in vignette("wrangling").

grimmer() has some further parameters, but all of them can be used from within grimmer_map(). The other parameters will be discussed in that context because grimmer_map() is often the more useful function in practice. Furthermore, although grimmer() is vectorized, grimmer_map() is safer and more convenient for testing multiple combinations of means, SDs, and sample sizes.

Many cases: `grimmer_map()`

If you want to GRIMMER-test more than a handful of cases, the recommended way is to enter them into a data frame and to run grimmer_map() on the data frame. Two different ways to do that are discussed in vignette("wrangling"), but here, I will only describe an easily accessible solution for a single table.

Copy summary data from a PDF file and paste them into tibble::tribble(), which is available via scrutiny:

flying_pigs1 <- tibble::tribble(
  ~x,   ~sd,    ~n,
"8.9",  "2.81",  25,
"2.6",  "2.05",  25,
"7.2",  "2.89",  25,
"3.6",  "3.11",  25,
"9.2",  "7.13",  25,
"10.4", "2.53",  25,
"7.3",  "3.14",  25
)

Use RStudio's multiple cursors to draw quotation marks around all the x and sd values, and to set commas at the end. See vignette("wrangling"), section With copy and paste, if you are not sure how to do that.

Now, simply run grimmer_map() on that data frame:

grimmer_map(flying_pigs1)

The x and n columns are the same as in the input. By default, the number of items composing the mean is assumed to be 1. The main result, consistency, is the GRIMMER consistency of the former three columns.

The reason column says why a set of values was inconsistent. To be GRIMMER-consistent, a value set needs to pass four separate tests: the three GRIMMER tests by @allard2018 and the more basic GRIM test. Here, the two inconsistent values passed GRIM as well as the first two GRIMMER tests, but failed the third one. All consistent value sets are marked as "Passed all" in the "reason" column.

Here is a quick reference for the three GRIMMER tests. See @allard2018 for an explanation.

The reconstructed sum of squared observations must be a whole number.
The reconstructed SD must match the reported one.
The parity of the reconstructed sum of squared observations must match the parity of the reconstructed sum of integers of which the reported means are fractions; i.e., either both are even or both are odd.

Scale items

If a mean is composed of multiple items, set the items parameter to that number. Below are hypothetical means of a three-items scale. With the single-item default, half of these are wrongly flagged as GRIM-inconsistent (true GRIMMER example below):

flying_pigs2 <- tibble::tribble(
   ~x,    ~sd,    ~n,
  "5.90", "2.19",  40,
  "5.71", "1.42",  40,
  "3.50", "1.81",  40,
  "3.82", "2.43",  40,
  "4.61", "1.92",  40,
  "5.24", "2.51",  40
)

flying_pigs2 %>% 
  grimmer_map()  # default is wrong here!

Yet, all of them are consistent if the correct number of items is stated:

flying_pigs2 %>% 
  grimmer_map(items = 3)

It is also possible to include an items column in the data frame instead. This is helpful if the rows have different numbers of items:

flying_pigs3 <- tibble::tribble(
   ~x,     ~sd,    ~n,  ~items,
  "6.92",  "2.19",  30,  1,
  "3.48",  "1.42",  30,  1,
  "1.59",  "1.81",  30,  2,
  "2.61",  "2.43",  30,  2,
  "4.04",  "1.92",  30,  3,
  "4.50",  "2.51",  30,  3,
)

flying_pigs3 %>% 
  grimmer_map()

The items values are multiplied by the n values. By default, the product is shown as n in the output (merge_items = TRUE), but overriding this default won't affect the test results.

However, if we remove the items column, crucial information is missing. As a consequence, the last two values are wrongly flagged as inconsistent, one of them by GRIMMER:

flying_pigs3 %>%
    dplyr::select(-items) %>% 
    grimmer_map()

Summarizing results with `audit()`

Following up on a call to grimmer_map(), the generic function audit() summarizes GRIMMER test results:

flying_pigs1 %>% 
  grimmer_map() %>% 
  audit()

These columns are ---

incons_cases: number of GRIMMER-inconsistent value sets.
all_cases: total number of value sets.
incons_rate: proportion of GRIMMER-inconsistent value sets.
fail_grim, fail_test1, fail_test2, fail_test3: number of value sets failing the GRIM test or one of the three GRIMMER tests, respectively (see Allard 2018).

Visualizing results with `grim_plot()`

GRIMMER does not currently have a dedicated visualization function in scrutiny. However, grim_plot() will accept the output of grimmer_map() just as well as that from grim_map():

flying_pigs4 <- tibble::tribble(
  ~x,      ~sd,    ~n,
  "7.19",  "1.19",  54,
  "4.56",  "2.56",  66,
  "0.42",  "1.29",  59,
  "1.31",  "3.50",  57,
  "3.48",  "3.65",  66,
  "4.27",  "2.86",  61,
  "6.21",  "2.15",  62,
  "3.11",  "3.17",  50,
  "5.39",  "2.37",  68,
  "5.66",  "1.11",  44,
)


flying_pigs4 %>% 
  grimmer_map() %>% 
  grim_plot()

However, grim_plot() will fail with any object not returned by either of these two functions:

grim_plot(mtcars)

See the GRIM vignette section on grim_plot() for more information.

Testing numeric sequences with `grimmer_map_seq()`

GRIMMER analysts might be interested in a mean or percentage value's numeric neighborhood. Suppose you found multiple GRIMMER inconsistencies as in out example pigs5 data. You might wonder whether they are due to small reporting or computing errors.

Use grimmer_map_seq() to GRIMMER-test the values surrounding the reported means and sample sizes:

out_seq1 <- grimmer_map_seq(pigs5)
out_seq1

Summaries with `audit_seq()`

As this output is a little unwieldy, run audit_seq() on the results:

audit_seq(out_seq1)

Here is what the output columns mean:

x and n are the original inputs, reconstructed and tested for consistency here.
hits is the number of GRIMMER-consistent value combinations found within the specified dispersion range.
diff_x reports the absolute difference between x and the next consistent dispersed value (in dispersion steps, not the actual numeric difference). diff_x_up and diff_x_down report the difference to the next higher or lower consistent value, respectively.
diff_n, diff_n_up, and diff_n_down do the same for n.

The default for dispersion is 1:5, for five steps up and down. When the dispersion sequence gets longer, the number of hits tends to increase:

out_seq2 <- grimmer_map_seq(pigs5, dispersion = 1:10)
audit_seq(out_seq2)

Visualizing GRIMMER-tested sequences

It's curious what happens when we plot the output of grimmer_map_seq(). Like regular GRIM or GRIMMER plots, however, it does give us a sense of how many tested values are consistent:

grim_plot(out_seq1)

The crosses appear because grimmer_map_seq() creates sequences around both x and n. Restrict this process to any one of these with the var argument:

out_seq1_only_x <- grimmer_map_seq(pigs5, var = "x")
out_seq1_only_n <- grimmer_map_seq(pigs5, var = "n")

grim_plot(out_seq1_only_x)
grim_plot(out_seq1_only_n)

Handling unknown group sizes with `grimmer_map_total_n()`

Problems from underreporting

Unfortunately, some studies that report group averages don't report the corresponding group sizes --- only a total sample size. This makes any direct GRIMMER-testing impossible because only x values are known, not n values. All that is feasible here in terms of GRIMMER is to take a number around half the total sample size, go up and down from it, and check which hypothetical group sizes are consistent with the reported group means. grimmer_map_total_n() semi-automates this process, motivated by a recent GRIM analysis [@bauer_expression_2021].

Here is an example:

flying_pigs5 <- tibble::tribble(
    ~x1,    ~x2,    ~sd1,   ~sd2,   ~n,
    "3.43", "5.28", "1.09", "2.12", 70,
    "2.97", "4.42", "0.43", "1.65", 65
)

out_total_n <- grimmer_map_total_n(flying_pigs5)
out_total_n

audit_total_n(out_total_n)

See the GRIM vignette, section Handling unknown group sizes with grim_map_total_n(), for a more comprehensive case study. It uses grim_map_total_n(), which is the same as grimmer_map_total_n() but only for GRIM.

References

Any scripts or data that you put into this service are public.

scrutiny documentation built on Sept. 22, 2024, 9:06 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

scrutiny
Error Detection in Science

GRIMMER
In scrutiny: Error Detection in Science

Basic GRIMMER testing

Few cases: `grimmer()`

Many cases: `grimmer_map()`

Scale items

Summarizing results with `audit()`

Visualizing results with `grim_plot()`

Testing numeric sequences with `grimmer_map_seq()`

Summaries with `audit_seq()`

Visualizing GRIMMER-tested sequences

Handling unknown group sizes with `grimmer_map_total_n()`

Problems from underreporting

References

Try the scrutiny package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

scrutiny Error Detection in Science

GRIMMER In scrutiny: Error Detection in Science

Basic GRIMMER testing

Few cases: grimmer()

Many cases: grimmer_map()

Scale items

Summarizing results with audit()

Visualizing results with grim_plot()

Testing numeric sequences with grimmer_map_seq()

Summaries with audit_seq()

Visualizing GRIMMER-tested sequences

Handling unknown group sizes with grimmer_map_total_n()

Problems from underreporting

References

Try the scrutiny package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

scrutiny
Error Detection in Science

GRIMMER
In scrutiny: Error Detection in Science

Few cases: `grimmer()`

Many cases: `grimmer_map()`

Summarizing results with `audit()`

Visualizing results with `grim_plot()`

Testing numeric sequences with `grimmer_map_seq()`

Summaries with `audit_seq()`

Handling unknown group sizes with `grimmer_map_total_n()`