R/factors.r

Defines functions combine_factor

Documented in combine_factor

# Combine factor levels
# Convenience function to make it easy to combine multiple levels
# in a factor into one.
# 
# @arguments factor variable
# @arguments either a character vector of levels, or a numeric vector of their positions.  See examples for more details.
# @arguments label for other level
# @keyword manip 
#X df <- data.frame(a = LETTERS[sample(5, 15, replace=TRUE)], y = rnorm(15))  
#X combine_factor(df$a, c(1,2,2,1,2))
#X combine_factor(df$a, c(1:4, 1))
#X (f <- reorder(df$a, df$y))
#X percent <- tapply(abs(df$y), df$a, sum)
#X combine_factor(f, c(order(percent)[1:3]))
combine_factor <- function(fac, variable=levels(fac), other.label="Other") {
  n <- length(levels(fac))
  if (length(variable) < n) {
    nvar <- c(seq(1, length(variable)), rep(length(variable)+1, n - length(variable)))
    factor(nvar[as.numeric(fac)], labels=c(levels(fac)[variable], other.label))
  } else {
    factor(variable[as.numeric(fac)], labels=levels(fac)[!duplicated(variable)])    
  }
} 

Try the reshape package in your browser

Any scripts or data that you put into this service are public.

reshape documentation built on April 12, 2022, 5:07 p.m.