# R/factor_ueb.R In uebvhir/anaStatsUEB: Statistical analysis function, Statistics and Bioinformatics Unit (UEB, VHIR)

#### Documented in factor_ueb

```#' factor_ueb Function
#'
#' The function factor is used to encode a vector as a factor (the terms ‘category’ and ‘enumerated type’ are also used for factors).
#' @param x   a vector of data, usually taking a small number of distinct values.
#' @param levels an optional vector of the values (as character strings) that x might have taken. The default is the unique set of values taken by as.character(x), sorted into increasing order of x. Note that this set can be specified as smaller than sort(unique(x))
#' @param labels either an optional character vector of (unique) labels for the levels (in the same order as levels after removing those in exclude), or a character string of length 1.
#' @param del.empty.val elimina los niveles que tienen 0 casos
#' @param name.var nom de la variable. Per defecte agafa la part dreta del "\$" (exemple: mtcars\$mpg, utilitza "mpg")
#' @export factor_ueb
#' @author Miriam Mota  \email{[email protected]@vhir.org}
#' @examples
#' variable <- c(0,0,0,1,1,1,0,"a",1,0, "B")
#' factor_ueb(variable, levels = c(0,1), labels = c("No", "Si"))
#'
#' variable <- c(0,0,0,1,1,1,0,10,1,0)
#' factor_ueb(variable, levels = c(0,1), labels = c("No", "Si"))
#'
#' variable <- c(0,0,0,1,1,1,0,1,1,0, NA)
#' factor_ueb(variable, levels = c(0,1), labels = c("No", "Si"))
#' @return factor returns an object of class "factor" which has a set of integer codes the length of x with a "levels" attribute of mode character and unique (!anyDuplicated(.)) entries. If argument ordered is true (or ordered() is used) the result has class c("ordered", "factor"). Undocumentedly for a long time, factor(x) loses all attributes(x) but "names", and resets "levels" and "class".

#' @keywords factor variable class levels labels

factor_ueb <- function(x, levels, labels, del.empty.val = TRUE, name.var = NULL){

name_sep <- strsplit(deparse(substitute(x)), "\$",fixed = T)[[1]]
name.var <- ifelse(is.null(name.var), name_sep[length(name_sep)], name.var)
unique_noNA <- unique(x)[!unique(x) %in% NA]
levels_dif <- unique_noNA  %in% levels
# paste0 ( unique_noNA[!levels_dif], which(x == unique_noNA[!levels_dif])   )
if (!all(levels_dif)) { warning("Los individuos con valor '", paste(unique_noNA[!levels_dif], collapse = "', '"),
"' para la variable ",name.var," han sido considerados NA \n", call. = FALSE) }
var_factor <- factor(x, levels = levels, labels = labels )
if (del.empty.val) var_factor <- factor(var_factor)

return(var_factor)
}
```
uebvhir/anaStatsUEB documentation built on June 11, 2018, 4:43 a.m.