SFRR: Raking ratio estimator

Description Usage Arguments Details Value References See Also Examples

Description

Produces estimates for population total and mean using the raking ratio estimator from survey data obtained from a dual frame sampling desing. Confidence intervals are also computed, if required.

Usage

1
2
SFRR(ysA, ysB, pi_A, pi_B, pik_ab_B, pik_ba_A, domains_A, domains_B, N_A, N_B, 
conf_level = NULL)

Arguments

ysA

A numeric vector of length n_A or a numeric matrix or data frame of dimensions n_A x c containing information about variable of interest from s_A.

ysB

A numeric vector of length n_B or a numeric matrix or data frame of dimensions n_B x c containing information about variable of interest from s_B.

pi_A

A numeric vector of length n_A or a square numeric matrix of dimension n_A containing first order or first and second order inclusion probabilities for units included in s_A.

pi_B

A numeric vector of length n_B or a square numeric matrix of dimension n_B containing first order or first and second order inclusion probabilities for units included in s_B.

pik_ab_B

A numeric vector of size n_A containing first order inclusion probabilities according to sampling desing in frame B for units belonging to overlap domain that have been selected in s_A.

pik_ba_A

A numeric vector of size n_B containing first order inclusion probabilities according to sampling desing in frame A for units belonging to overlap domain that have been selected in s_A.

domains_A

A character vector of size n_A indicating the domain each unit from s_A belongs to. Possible values are "a" and "ab".

domains_B

A character vector of size n_B indicating the domain each unit from s_A belongs to. Possible values are "b" and "ba".

N_A

A numeric value indicating the size of frame A

N_B

A numeric value indicating the size of frame B

conf_level

(Optional) A numeric value indicating the confidence level for the confidence intervals, if desired.

Details

Raking ratio estimator of population total is given by

\hat{Y}_{SFRR} = \frac{N_A - \hat{N}_{ab,rake}}{\hat{N}_a^A}\hat{Y}_a^A + \frac{N_B - \hat{N}_{ab,rake}}{\hat{N}_b^B}\hat{Y}_b^B + \frac{\hat{N}_{ab,rake}}{\hat{N}_{abS}}\hat{Y}_{abS}

where \hat{Y}_{abS} = ∑_{i \in s_{ab}^A}\tilde{d}_i^Ay_i + ∑_{i \in s_{ab}^B}\tilde{d}_i^By_i, \hat{N}_{abS} = ∑_{i \in s_{ab}^A}\tilde{d}_i^A + ∑_{i \in s_{ab}^B}\tilde{d}_i^B and \hat{N}_{ab,rake} is the smallest root of the quadratic equation \hat{N}_{ab,rake}x^2 - [\hat{N}_{ab,rake}(N_A + N_B) + \hat{N}_{aS}\hat{N}_{bS}]x + \hat{N}_{ab,rake}N_AN_B = 0, with \hat{N}_{aS} = ∑_{s_a^A}\tilde{d}_i^B and \hat{N}_{bS} = ∑_{s_b^B}\tilde{d}_i^B. Weights \tilde{d}_i^A and \tilde{d}_i^B are obtained as follows \tilde{d}_i^A =≤ft\{\begin{array}{lcc} d_i^A & \textrm{if } i \in a\\ (1/d_i^A + 1/d_i^B)^{-1} & \textrm{if } i \in ab \end{array} \right. and \tilde{d}_i^B =≤ft\{\begin{array}{lcc} d_i^B & \textrm{if } i \in b\\ (1/d_i^A + 1/d_i^B)^{-1} & \textrm{if } i \in ba \end{array} \right. being d_i^A and d_i^B the design weights, obtained as the inverse of the first order inclusion probabilities, that is d_i^A = 1/π_i^A and d_i^B = 1/π_i^B.

To obtain an estimator of the variance for this estimator, one has taken into account that raking ratio estimator coincides with SF calibration estimator when frame sizes are known and "raking" method is used. So, one can use here Deville's expression to calculate an estimator for the variance of the raking ratio estimator

\hat{V}(\hat{Y}_{SFRR}) = \frac{1}{1-∑_{k\in s} a_k^2}∑_{k\in s}(1-π_k)≤ft(\frac{e_k}{π_k} - ∑_{l\in s} a_{l} \frac{e_l}{π_l}\right)^2

where a_k=(1-π_k)/∑_{l\in s} (1-π_l) and e_k are the residuals of the regression with auxiliary variables as regressors.

Value

SFRR returns an object of class "EstimatorDF" which is a list with, at least, the following components:

Call

the matched call.

Est

total and mean estimation for main variable(s).

VarEst

variance estimation for main variable(s).

If parameter conf_level is different from NULL, object includes component

ConfInt

total and mean estimation and confidence intervals for main variables(s).

In addition, components TotDomEst and MeanDomEst are available when estimator is based on estimators of the domains. Component Param shows value of parameters involded in calculation of the estimator (if any). By default, only Est component (or ConfInt component, if parameter conf_level is different from NULL) is shown. It is possible to access to all the components of the objects by using function summary.

References

Lohr, S. and Rao, J.N.K. (2000). Inference in Dual Frame Surveys. Journal of the American Statistical Association, Vol. 95, 271 - 280.

Rao, J.N.K. and Skinner, C.J. (1996). Estimation in Dual Frame Surveys with Complex Designs. Proceedings of the Survey Method Section, Statistical Society of Canada, 63 - 68.

Skinner, C.J. and Rao J.N.K. (1996). Estimation in Dual Frame Surveys with Complex Designs. Journal of the American Statistical Association, Vol. 91, 443, 349 - 356.

Skinner, C.J. (1991). On the Efficiency of Raking Ratio Estimation for Multiple Frame Surveys. Journal of the American Statistical Association, Vol. 86, 779 - 784.

See Also

JackSFRR

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
data(DatA)
data(DatB)
data(PiklA)
data(PiklB)

#Let calculate raking ratio estimator for population total for variable Clothing
SFRR(DatA$Clo, DatB$Clo, PiklA, PiklB, DatA$ProbB, DatB$ProbA, DatA$Domain, 
DatB$Domain, 1735, 1191)

#Now, let calculate raking ratio estimator and a 90% confidence interval for 
#population total for variable Feeding, considering only first order inclusion probabilities
SFRR(DatA$Feed, DatB$Feed, DatA$ProbA, DatB$ProbB, DatA$ProbB, DatB$ProbA, 
DatA$Domain, DatB$Domain, 1735, 1191, 0.90)

Frames2 documentation built on May 2, 2019, 8:13 a.m.