keyDiagnostic: Diagnose accuracy of result from applying variable key to...

View source: R/variableKey.R

keyDiagnosticR Documentation

Diagnose accuracy of result from applying variable key to data

Description

Compare the old and new data frames, checking for accuracy of calculations in various ways.

Usage

keyDiagnostic(
  dfold,
  dfnew,
  keylist,
  max.values = 20,
  nametrunc = 18,
  wide = 200,
  confidential = FALSE
)

Arguments

dfold

Original data frame

dfnew

The new recoded data frame

keylist

The imported variable key that was used to transform dfold into dfnew.

max.values

Show up to this number of values for the old variable

nametrunc

Truncate column and row names. Needed if there are long factor labels and we want to fit more information on table. Default = 18 for new name, old name is 10 more characters (18 + 10 = 28).

wide

Number of characters per row in printed output. Suggest very wide screen, default = 200.

confidential

Should numbers in table be rounded to nearest "10" to comply with security standard enforced by some American research departments.

Details

CAUTION: This can print WIDE matrices. Because the on-screen output will be WIDE, make the display window WIDE!

Crosstabulate new variable versus old variable to see the coding mismatches. For tables of up to 10 values or so, that will be satisfactory.

For numeric variables, it appears there is no good thing to do except possibly to re-apply any transformations.

Author(s)

Paul Johnson <pauljohn@ku.edu>


kutils documentation built on Sept. 17, 2023, 5:06 p.m.