View source: R/svymean_winsorized.R
svymean_winsorized | R Documentation |
Weighted winsorized mean and total
svymean_winsorized(x, design, LB = 0.05, UB = 1 - LB, na.rm = FALSE,
trim_var = FALSE, ...)
svymean_k_winsorized(x, design, k, na.rm = FALSE, trim_var = FALSE, ...)
svytotal_winsorized(x, design, LB = 0.05, UB = 1 - LB, na.rm = FALSE,
trim_var = FALSE, ...)
svytotal_k_winsorized(x, design, k, na.rm = FALSE, trim_var = FALSE, ...)
x |
a one-sided |
design |
an object of class |
LB |
|
UB |
|
na.rm |
|
trim_var |
|
k |
|
... |
additional arguments (currently not used). |
Package survey must be attached to the search path in order to use
the functions (see library
or require
).
Population mean or total. Let \mu
denote the estimated winsorized population mean; then, the
estimated winsorized total is given by
\hat{N} \mu
with
\hat{N} =\sum w_i
, where summation
is over all observations in the sample.
The amount of winsorization can be specified in relative or absolute terms:
Relative: By specifying LB
and UB
,
the method winsorizes the LB
~\cdot 100\%
of the smallest observations and the
(1 - UB
)~\cdot 100\%
of the largest
observations from the data.
Absolute: By specifying argument k
in the
functions with the "infix" _k_
in their name (e.g.,
svymean_k_winsorized
), the
largest k
observations are winsorized, 0<k<n
,
where n
denotes the sample size. E.g., k = 2
implies that the largest and the second largest observation
are winsorized.
Large-sample approximation based on the influence function; see Huber and Ronchetti (2009, Chap. 3.3) and Shao (1994). Two estimators are available:
simple_var = FALSE
Variance estimator of
the winsorized mean/ total. The estimator depends on
the estimated probability density function evaluated at
the winsorization thresholds, which can be – depending
on the context – numerically unstable. As a remedy,
a simplified variance estimator is available by
setting simple_var = TRUE
.
simple_var = TRUE
Variance is approximated using the variance estimator of the trimmed mean/ total.
summary
,
coef
, SE
,
vcov
,
residuals
,
fitted
and
robweights
.
See:
weighted_mean_winsorized
,
weighted_mean_k_winsorized
,
weighted_total_winsorized
,
weighted_total_k_winsorized
.
Object of class svystat_rob
Huber, P. J. and Ronchetti, E. (2009). Robust Statistics, New York: John Wiley and Sons, 2nd edition. \Sexpr[results=rd]{tools:::Rd_expr_doi("10.1002/9780470434697")}
Shao, J. (1994). L-Statistics in Complex Survey Problems. The Annals of Statistics 22, 976–967. \Sexpr[results=rd]{tools:::Rd_expr_doi("10.1214/aos/1176325505")}
Overview (of all implemented functions)
weighted_mean_winsorized
,
weighted_mean_k_winsorized
,
weighted_total_winsorized
and
weighted_total_k_winsorized
head(workplace)
library(survey)
# Survey design for stratified simple random sampling without replacement
dn <- if (packageVersion("survey") >= "4.2") {
# survey design with pre-calibrated weights
svydesign(ids = ~ID, strata = ~strat, fpc = ~fpc, weights = ~weight,
data = workplace, calibrate.formula = ~-1 + strat)
} else {
# legacy mode
svydesign(ids = ~ID, strata = ~strat, fpc = ~fpc, weights = ~weight,
data = workplace)
}
# Estimated winsorized population mean (5% symmetric winsorization)
svymean_winsorized(~employment, dn, LB = 0.05)
# Estimated one-sided k winsorized population total (2 observations are
# winsorized at the top of the distribution)
svytotal_k_winsorized(~employment, dn, k = 2)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.