util_string_is_not_categorical: Utility function for judging whether a character vector does...

View source: R/util_string_is_not_categorical.R

util_string_is_not_categoricalR Documentation

Utility function for judging whether a character vector does not appear to be a categorical variable

Description

The function considers the following properties:

  • the maximum number of characters (to identify free text fields with long entries),

  • the relative frequency of punctuation and space characters per element (to identify, e.g., JSON or XML elements, which are structured by those characters),

  • the relative frequency of elements (categorical variables would have a low proportion of unique values in comparison to other variables).

Usage

util_string_is_not_categorical(vec)

Arguments

vec

a character vector

Value

TRUE or FALSE


dataquieR documentation built on May 29, 2024, 7:18 a.m.