other_label: Automatically assign "other" to low observation count...
In athompson1991/groupR: Comprehensive manipulation of dataset with applications in forecasting

Description Usage Arguments Value Examples

Often times, a dataset will have groups which will see almost all rows fall into a few group values, but there are many smaller group values for the remaining observations. For example, you may have a dataset with employee level observations and want to use "US State" as a group, but 90% of the observations fall into New York, California, Texas, and perhaps 6 other states. All remaining observations are distributed amongst the remaining 41 states, but you might prefer to lump all of those observations into a single bucket. This functions provides a way to reassign all those observations to "other".

1	other_label(df, column, percentile = 0.9, custom = NULL)

`df`	The dataframe to be manipulated
`column`	Which column to relabel
`percentile`	Which percentage to cut off the data at
`custom`	A custom vector of values to reassign to "other" in the dataset

The dataframe with reassigned column

1
2
3

summary(as.factor(permits$type_desc))
permits_cleaned <- other_label(permits, "type_desc")
summary(as.factor(permits_cleaned$type_desc))

athompson1991/groupR documentation built on May 10, 2019, 2:09 p.m.

athompson1991/groupR index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

athompson1991/groupR
Comprehensive manipulation of dataset with applications in forecasting

other_label: Automatically assign "other" to low observation count...
In athompson1991/groupR: Comprehensive manipulation of dataset with applications in forecasting

Description

Usage

Arguments

Value

Examples

Related to other_label in athompson1991/groupR...

R Package Documentation

Browse R Packages

We want your feedback!

athompson1991/groupR Comprehensive manipulation of dataset with applications in forecasting

other_label: Automatically assign "other" to low observation count... In athompson1991/groupR: Comprehensive manipulation of dataset with applications in forecasting

Description

Usage

Arguments

Value

Examples

Related to other_label in athompson1991/groupR...

R Package Documentation

Browse R Packages

We want your feedback!

athompson1991/groupR
Comprehensive manipulation of dataset with applications in forecasting

other_label: Automatically assign "other" to low observation count...
In athompson1991/groupR: Comprehensive manipulation of dataset with applications in forecasting