View source: R/acc_distributions.R
acc_distributions_loc | R Documentation |
Data quality indicator checks "Unexpected location" and "Unexpected proportion" with histograms and, if a grouping variable is included, plots of empirical cumulative distributions for the subgroups.
Indicator
acc_distributions_loc(
resp_vars = NULL,
study_data,
meta_data,
label_col,
check_param = "location",
plot_ranges = TRUE,
flip_mode = "noflip"
)
resp_vars |
variable list the names of the measurement variables |
study_data |
data.frame the data frame that contains the measurements |
meta_data |
data.frame the data frame that contains metadata attributes of study data |
label_col |
variable attribute the name of the column in the metadata with labels of variables |
check_param |
enum any | location | proportion. Which type of check should be conducted (if possible): a check on the location of the mean or median value of the study data, a check on proportions of categories, or either of them if the necessary metadata is available. |
plot_ranges |
logical Should the plot show ranges and results from the data quality checks? (default: TRUE) |
flip_mode |
enum default | flip | noflip | auto. Should the plot be
in default orientation, flipped, not flipped or
auto-flipped. Not all options are always supported.
In general, this con be controlled by
setting the |
A list with:
SummaryTable
: data.frame containing data quality checks for
"Unexpected location" (FLG_acc_ud_loc
) and "Unexpected
proportion" (FLG_acc_ud_prop
) for each response
variable in resp_vars
.
SummaryData
: a data.frame containing data quality checks for
"Unexpected location" and / or "Unexpected proportion"
for a report
SummaryPlotList
: list of ggplots for each response variable in
resp_vars
.
If no response variable is defined, select all variables of type float or integer in the study data.
Remove missing codes from the study data (if defined in the metadata).
Remove measurements deviating from (hard) limits defined in the metadata (if defined).
Exclude variables containing only NA
or only one unique value (excluding
NA
s).
Perform check for "Unexpected location" if defined in the metadata (needs a LOCATION_METRIC (mean or median) and LOCATION_RANGE (range of expected values for the mean and median, respectively)).
Perform check for "Unexpected proportion" if defined in the metadata (needs PROPORTION_RANGE (range of expected values for the proportions of the categories)).
Plot histogram(s).
If group_vars is specified by the user, distributions within group-wise ecdf are presented.
acc_distributions
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.