read_2002: Read HSE 2002

Description Usage Arguments Details Value Examples

View source: R/read_2002.R

Description

Reads and does basic cleaning on the Health Survey for England 2002.

Usage

1
2
3
4
read_2002(
  root = c("X:/", "/Volumes/Shared/"),
  file = "ScHARR/PR_Consumption_TA/HSE/HSE 2002/UKDA-4912-tab/tab/hse02ai.tab"
)

Arguments

root

Character - the root directory.

file

Character - the file path and name.

Details

As well as providing a sample designed to give a cross-section of the population, HSE 2002 also focussed on the health of a number of specific groups, including: infants and children (aged 0-15), young adults (aged 16-24) and mothers with infants aged under 1. Addresses sampled in each postal sector were systematically allocated to one of two groups: Sample I (29 addresses) or Sample II (9 addresses). Sample I was designed to boost the proportion of children, young people and mothers of infants, and Sample II to provide a sample of the general population. At Sample I addresses all persons aged 0-24 were eligible for inclusion in the survey, as were all mothers of infants aged under 1 (there was no upper age limit for the mothers). At Sample II addresses all persons were eligible for interview. At both Sample I and II addresses, where there were more than two children aged 0-15, two children were selected at random. Information was obtained directly from persons aged 13 and over. Information about children aged under 13 was obtained from a parent, with the child present.

An interview with each eligible person (Stage 1) was followed by a visit by a nurse (Stage 2), who made a number of measurements and in some cases obtained a blood sample and a saliva sample. Both interviewers and nurses used computer-assisted interviewing. Blood and saliva samples were sent to a laboratory for analysis.

WEIGHTING

In HSE 2002, the sample was boosted in order to obtain greater numbers of children, young adults (aged 16-24) and mothers of infants under 1. While children aged 0-15 and young adults aged 16-24 were sampled from all selected addresses, adults aged 25 and over were selected only at Sample II addresses (i.e. they were selected at only 9 out of the 38 addresses included within each postcode sector). Consequently, in HSE 2002, those aged 25 and over were under-represented in the final dataset. Different weights were applied to different age groups as explained below:

MISSING VALUES

Value

Returns a data table. Note that:

Examples

1
2
3
4
5
6
## Not run: 

data_2002 <- read_2002("X:/", "ScHARR/PR_Consumption_TA/HSE/HSE 2002/UKDA-4912-tab/tab/hse02ai.tab")


## End(Not run)

dosgillespie/hseclean documentation built on May 2, 2020, 1:15 a.m.