#' @title Convert ANZSIC names and abbreviations into the consistent format used by the Australian Bureau of Statistics.
#'
#' @description This function enables both exact (default) and fuzzy matching. Under exact matching, if no match is found, NA is returned.
#'
#' @param x a (character) vector containing ANZSIC titles.
#' Note that clean_anzsic always returns a character vector. If no match is found, then NA is returned.
#'
#' @param fuzzy_match logical; either TRUE which indicates that
#' approximate/fuzzy string matching should be used, or FALSE (the default) which indicates that
#' only exact matches should be used. If FALSE, then if no match is found, then NA is returned.
#'
#' @param max_dist numeric, sets the maximum acceptable distance between your
#' string and the matched string. Default is 0.4. Only relevant when fuzzy_match is TRUE.
#'
#' @param method the method used for approximate/fuzzy string matching. Default
#' is "jw", the Jaro-Winker distance; see `??stringdist-metrics` for more options.
#' Only relevant when fuzzy_match is TRUE.
#'
#' @param silent a boolean value. If FALSE (the default), the function will warn that NAs were returned.
#'
#' @return a character vector
#'
#' @family cleaning functions
#'
#' @seealso \code{\link{clean_anzsco}} for ANZSCO.
#'
#' @examples
#'
#' clean_anzsic("Agriculture forestry & Fishing")
#'
#' clean_anzsic("Agriculture, Forestry", fuzzy_match = TRUE)
#'
#' @export
#'
#'
clean_anzsic <- function(x,
fuzzy_match = FALSE,
max_dist = 0.4,
method = "jw",
silent = FALSE) {
clean_titles(
dictionary = anzsic_dictionary,
.vector = x,
.fuzzy_match = fuzzy_match,
.max_dist = max_dist,
.method = method,
.silent = silent
)
}
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.