Methods for model selection, model averaging, and calculating metrics, such as the Gini, Theil, Mean Log Deviation, etc, on binned income data where the topmost bin is right-censored. We provide both a non-parametric method, termed the bounded midpoint estimator (BME), which assigns cases to their bin midpoints; except for the censored bins, where cases are assigned to an income estimated by fitting a Pareto distribution. Because the usual Pareto estimate can be inaccurate or undefined, especially in small samples, we implement a bounded Pareto estimate that yields much better results. We also provide a parametric approach, which fits distributions from the generalized beta (GB) family. Because some GB distributions can have poor fit or undefined estimates, we fit 10 GB-family distributions and use multimodel inference to obtain definite estimates from the best-fitting distributions. We also provide binned income data from all United States of America school districts, counties, and states.
|Author||Samuel V. Scarpino, Paul von Hippel, and Igor Holas|
|Date of publication||2016-12-17 01:36:01|
|Maintainer||Samuel V. Scarpino <email@example.com>|
|License||GPL (>= 3.0)|
binequality-package: Methods for Analyzing Binned Income Data
county_bins: A data set containing binned income for US counties
fitFunc: A function to fit a parametric distribution to binned data.
getMids: A function to calculate the bin midpoints.
getQuantilesParams: A function to extract the quantiles and parameters
giniCoef: Calculates the Gini coefficient from quantiles
LRT: A function to perform likelihood ratio tests
makeFitComb: A function to transform a list into a dataframe
makeInt: A function to create a survival object from bin counts.
makeIntWeight: A function to create a survival object from bin counts and...
makeWeightsAIC: A function to calculate AIC weights
mAvg: A simple function to perfom model averaging using...
midStats: A function to calculate statistics using bin midpoints
MLD: A function to calculate the MLD
modelAvg: A function to calculate model averages
paramFilt: A function to filter models based on estimated parameters
run_GB_family: A function to fit a parametric distributions to binned data.
school_district_bins: A data set containing the school district data.
SDL: A function to calculate the SDL
state_bins: A data set containing the binned state data.
theilInd: A function to calculate the Theil