paretoTail | R Documentation |
Fit a Pareto distribution to the upper tail of income data. Since a theoretical distribution is used for the upper tail, this is a semiparametric approach.
paretoTail(
x,
k = NULL,
x0 = NULL,
method = "thetaPDC",
groups = NULL,
w = NULL,
alpha = 0.01,
...
)
x |
a numeric vector. |
k |
the number of observations in the upper tail to which the Pareto distribution is fitted. |
x0 |
the threshold (scale parameter) above which the Pareto distribution is fitted. |
method |
either a function or a character string specifying the function
to be used to estimate the shape parameter of the Pareto distibution, such as
|
groups |
an optional vector or factor specifying groups of elements of
|
w |
an optional numeric vector giving sample weights. |
alpha |
numeric; values above the theoretical |
... |
addtional arguments to be passed to the specified method. |
The arguments k
and x0
of course correspond with each other.
If k
is supplied, the threshold x0
is estimated with the n
- k
largest value in x
, where n
is the number of observations.
On the other hand, if the threshold x0
is supplied, k
is given
by the number of observations in x
larger than x0
. Therefore,
either k
or x0
needs to be supplied. If both are supplied,
only k
is used.
The function supplied to method
should take a numeric vector (the
observations) as its first argument. If k
is supplied, it will be
passed on (in this case, the function is required to have an argument called
k
). Similarly, if the threshold x0
is supplied, it will be
passed on (in this case, the function is required to have an argument called
x0
). As above, only k
is passed on if both are supplied. If
the function specified by method
can handle sample weights, the
corresponding argument should be called w
. Additional arguments are
passed via the ... argument.
An object of class "paretoTail"
with the following
components:
x |
the supplied numeric vector. |
k |
the number of observations in the upper tail to which the Pareto distribution has been fitted. |
groups |
if supplied, the vector or factor specifying groups of elements. |
w |
if supplied, the numeric vector of sample weights. |
method |
the function used to estimate the shape parameter, or the name of the function. |
x0 |
the scale parameter. |
theta |
the estimated shape parameter. |
tail |
if |
alpha |
the tuning parameter |
out |
if |
Andreas Alfons
A. Alfons and M. Templ (2013) Estimation of Social Exclusion Indicators from Complex Surveys: The R Package laeken. Journal of Statistical Software, 54(15), 1–25. \Sexpr[results=rd]{tools:::Rd_expr_doi("10.18637/jss.v054.i15")}
A. Alfons, M. Templ, P. Filzmoser (2013) Robust estimation of economic indicators from survey samples based on Pareto tail modeling. Journal of the Royal Statistical Society, Series C, 62(2), 271–286.
reweightOut
, shrinkOut
,
replaceOut
, replaceTail
, fitPareto
thetaPDC
, thetaWML
, thetaHill
,
thetaISE
, thetaLS
, thetaMoment
,
thetaQQ
, thetaTM
data(eusilc)
## gini coefficient without Pareto tail modeling
gini("eqIncome", weights = "rb050", data = eusilc)
## gini coefficient with Pareto tail modeling
# estimate threshold
ts <- paretoScale(eusilc$eqIncome, w = eusilc$db090,
groups = eusilc$db030)
# estimate shape parameter
fit <- paretoTail(eusilc$eqIncome, k = ts$k,
w = eusilc$db090, groups = eusilc$db030)
# calibration of outliers
w <- reweightOut(fit, calibVars(eusilc$db040))
gini(eusilc$eqIncome, w)
# winsorization of outliers
eqIncome <- shrinkOut(fit)
gini(eqIncome, weights = eusilc$rb050)
# replacement of outliers
eqIncome <- replaceOut(fit)
gini(eqIncome, weights = eusilc$rb050)
# replacement of whole tail
eqIncome <- replaceTail(fit)
gini(eqIncome, weights = eusilc$rb050)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.