lival | R Documentation |
The dataset is an extended version of the public micro data file of the LSMS 2012 of Albania available at (https://www.instat.gov.al/en/figures/micro-data/, accessed 13 February 2023). Documentation of the LSMS 2012 of Albania is from the World Bank (https://microdata.worldbank.org/index.php/catalog/1970, accessed 5 November 2020). The data set is ported to R and updated with approximate survey design information derived from the data itself. The units are households and the variables are expenditures on main categories, poverty measures and structural information including weights and sample design.
lival
A data frame with 6671 rows and 26 variables
primary sampling unit (psu)
unique household identifier (100*psu+hh)
household number per psu
prefecture
urbanicity (Urban=1, Rural=2)
stratum
region
total consumption of hh
real mean per capita consumption
real food consumption per capita
real non food consumption per capita
real education consumption per capita
real durable consumption per capita
real utilities consumption per capita
extreme headcount poverty
extreme poverty gap
extreme poverty depth
absolute headcount poverty
absolute poverty gap
absolute poverty depth
final cross-sectional weight
number of psu in stratum population
number of households in stratum population
number of households in sampled psu
psu inclusion probability
household inclusion probability
Absolute poverty measures use a poverty line of Lek 4891 (2002 prices).
Extreme poverty measures use a poverty line where the basic nutritional needs are
difficult to meet.
The headcount poverty variable is an indicator for the income of the household y_i
being below the (absolute or extreme) poverty line z
.
The poverty gap variable measures the relative distance to the poverty line: (z-y_i)/z
.
The poverty depth variable is the square of the poverty gap variable, i.e. [(z-y_i)/z]^2
,
giving more weight to the poorer among the poor and thus describing the inequality
among the poor.
The survey design is a stratified clustered two stage design. The primary sampling units are enumeration zones. The strata are the crossing of prefecture and urbanicity and the allocation of the psu sample to the strata is proportional to the number of households. Within strata the psu are sampled with probability proportional to number of households. Within psu a simple random sample of 8 households was selected. The weights are calibrated to population margins. All survey design informations except the strata and the weights are approximated through the weights using assumptions on the design. Since the data set has undergone data protection measures and the survey design is approximate only, inference to the population does not yield exact results. However, the complexity of the data and of the survey design are realistic.
The size of the household is not on the original data set.
However, the transformation capita <- round(0.07527689 * totcons/rcons, 0)
yields the number of persons in the household.
With R package survey
a survey design object can be built with, e.g., svydesign(~psu + hhid , strata= ~strat, fpc= ~pi1 +pi2, weight= ~weight, data=lival, pps="brewer")
.
https://www.instat.gov.al/en/figures/micro-data/
data(lival)
lival$capita <- with(lival, round(0.07527689 * totcons / rcons, 0))
## Not run:
library(survey)
lival.des <- svydesign(~psu + hhid , strata= ~strat, fpc= ~pi1 +pi2,
weight= ~weight, data=lival, pps="brewer")
svymean(~totcons, lival.des, deff=TRUE)
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.