wse | R Documentation |
Estimate the sensitivity parameter for a given cut-off point considering sampling weights with complex survey data.
wse(
response.var,
phat.var,
weights.var = NULL,
tag.event = NULL,
cutoff.value,
data = NULL,
design = NULL
)
response.var |
A character string with the name of the column indicating the response variable in the data set or a vector (either numeric or character string) with information of the response variable for all the units. |
phat.var |
A character string with the name of the column indicating the estimated probabilities in the data set or a numeric vector containing estimated probabilities for all the units. |
weights.var |
A character string indicating the name of the column with sampling weights or
a numeric vector containing information of the sampling weights.
It could be |
tag.event |
A character string indicating the label used to indicate the event of interest in |
cutoff.value |
A numeric value indicating the cut-off point to be used. No default value is set for this argument, and a numeric value must be indicated necessarily. |
data |
A data frame which, at least, must incorporate information on the columns
|
design |
An object of class |
Let S
indicate a sample of n
observations of the vector of random variables (Y,\pmb X)
, and \forall i=1,\ldots,n,
y_i
indicate the i^{th}
observation of the response variable Y
,
and \pmb x_i
the observations of the vector covariates \pmb X
. Let w_i
indicate the sampling weight corresponding to the unit i
and \hat p_i
the estimated probability of event.
Let S_0
and S_1
be subsamples of S
, formed by the units without the event of interest (y_i=0
) and with the event of interest (y_i=1
), respectively.
Then, the sensitivity parameter for a given cut-off point c
is estimated as follows:
\widehat{Se}_w(c)=\dfrac{\sum_{i\in S_1}w_i\cdot I (\hat p_i\geq c)}{\sum_{i\in S_1}w_i}.
See Iparragirre et al. (2022) and Iparragirre et al. (2023) for more details.
The output of this function is a list of 4 elements containing the following information:
Sew
: a numeric value indicating the weighted estimate of the sensitivity parameter.
tags
: list containing one element with the following information:
tag.event
: a character string indicating the label used to indicate event of interest.
basics
: a list containing information of the following 6 elements:
n
: a numeric value indicating the number of units in the data set.
n.event
: a numeric value indicating the number of units in the data set with the event of interest.
n.event.class
: a numeric value indicating the number of units in the data set with the event of interest that are correctly classified as events based on the selected cut-off point.
hatN
: number of units in the population, represented by all the units in the data set, i.e., the sum of the sampling weights of the units in the data set.
hatN.event
: number of units with the event of interest represented in the population by all the event units in the data set, i.e., the sum of the sampling weights of the units with the event of interest in the data set.
hatN.event.class
: number of event units represented in the population by the event units in the data set that have been correctly classified as events based on the selected cut-off point, i.e., the sum of the sampling weights of the correctly classified event units in the data set.
call
: an object saving the information about the way in which the function has been run.
Iparragirre, A., Barrio, I., Aramendi, J. and Arostegui, I. (2022). Estimation of cut-off points under complex-sampling design data. SORT-Statistics and Operations Research Transactions 46(1), 137–158. (https://doi.org/10.2436/20.8080.02.121)
Iparragirre, A., Barrio, I. and Arostegui, I. (2023). Estimation of the ROC curve and the area under it with complex survey data. Stat 12(1), e635. (https://doi.org/10.1002/sta4.635)
data(example_data_wroc)
se.obj <- wse(response.var = "y", phat.var = "phat", weights.var = "weights",
tag.event = 1, cutoff.value = 0.5, data = example_data_wroc)
# Or equivalently
se.obj <- wse(response.var = example_data_wroc$y,
phat.var = example_data_wroc$phat,
weights.var = example_data_wroc$weights,
tag.event = 1, cutoff.value = 0.5)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.