Natality_2014_100k: Medical info on each birth in the US in 2014

Description Usage Format Details See Also Examples

Description

The Centers for Disease Control collects data on all births registered in the US (50 states + DC). The full data set, Natality_2014, contains entries for the full set of births keeping only a subset of variables related to potential risk factors and outcomes. Natality_2014_100k, Natality_2014_10k, Natality_2014_1k are random samples of size one-hundred thousand, ten thousand, and one thousand respectively and are provided automatically when the natality2014 package is attached. A larger sample of one-million cases, Natality_2014_1000k, and the complete set of cases in Natality_2014 are also provided by the package, but not through the usual data() interface. To access those large files, see the instructions given in Larger_natality_data_files.

Usage

1

Format

A data frame with a random sample of size 100000 from the complete CDC set of 3,998,175 cases, each of which is a birth in the US in 2014.

Details

Variable names are the same as in the original CDC file.

Variables have been re-coded to translate markers like 99 into 'NA' and to self-document the levels of the variables.

Note that the data include only registered births. Fetal deaths are not included. For reference, there were about 24,000 fetal deaths (20 weeks and thereafter) in the US in 2013. (See http://www.cdc.gov/nchs/data/nvsr/nvsr64/nvsr64_08.pdf.) This corresponds to about 0.6 reports about 16,000 infant deaths before age 28 days and another 7500 deaths before age 1 year.

The smaller samples are nested in the larger ones. That is, all the cases in Natality_2014_10k are included in Natality_2014_1k, and so on. Larger data files are available by following the instructions in Larger_natality_data_files: Natality_2014 with 3.99 million cases and Natality_2014_1000k with a random sample of one million cases.

See Also

Larger_natality_data_files

Examples

1
2
3
table(Natality_2014_100k$ilive)
table(Natality_2014_10k$ilive) # from sample of 10,000
table(Natality_2014_1k$ilive) # from sample of 1000

dtkaplan/natality2014 documentation built on May 15, 2019, 5:22 p.m.