read_dataset: Functions to read most common Brazillian dataset

Description Usage Arguments Value

Description

The funtions read_DatasetName read each dataset easily and quickly. The functions gather the import parameters from the DatasetName_metadata_file_harmonization.csv files (availabe at folder extdata) and import dictionaries (available at folder data). Base on this dictionaries and parameters it dispatches the read_data function that reads the file

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
read_CensoEscolar(ft, i, harmonize_varnames = F, root_path = NULL,
  file = NULL, vars_subset = NULL, nrows = -1L,
  source_file_mark = F)

read_CensoEducacaoSuperior(ft, i, root_path = NULL, file = NULL,
  vars_subset = NULL, nrows = -1L, source_file_mark = F)

read_ENEM(ft, i, root_path = NULL, file = NULL, vars_subset = NULL,
  nrows = -1L, source_file_mark = F)

read_CENSO(ft, i, root_path = NULL, file = NULL, vars_subset = NULL,
  UF = NULL, nrows = -1L, source_file_mark = F)

read_RAIS(ft, i, root_path = NULL, file = NULL, vars_subset = NULL,
  UF = NULL, nrows = -1L, source_file_mark = F)

read_CAGED(ft, i, root_path = NULL, file = NULL, vars_subset = NULL,
  nrows = -1L, source_file_mark = F)

read_PNAD(ft, i, root_path = NULL, file = NULL, vars_subset = NULL,
  nrows = -1L, source_file_mark = F)

read_PME(ft, i, root_path = NULL, file = NULL, vars_subset = NULL,
  nrows = -1L, source_file_mark = F)

read_POF(ft, i, root_path = NULL, file = NULL, vars_subset = NULL,
  nrows = -1L, source_file_mark = F)

read_PNADcontinua(ft, i, root_path = NULL, file = NULL,
  vars_subset = NULL, nrows = -1L, source_file_mark = F)

read_PNS(ft, i, root_path = NULL, file = NULL, vars_subset = NULL,
  nrows = -1L, source_file_mark = F)

Arguments

ft

file type. Indicates the subdataset within the dataset. For example: "pessoa" (person) or "domicĂ­lio" (household) data from the "CENSO" (Census). For a list of available ft for the period just type an invalid ft (Ex: ft = 'aasfasf')

i

period. Normally period in YYY format.

harmonize_varnames

Should variable names be harmonized over the periods of the subdataset (ft)

root_path

(optional) a path to the directory where dataset was downloaded

file

(optional) file to read, ignore all metadata in this case

vars_subset

(optional) read only selected variables( named on the dictionary for fwf files or in the first row for delimited files)

nrows

(optional) read only n first rows

source_file_mark

(optional) TRUE/FALSE , if T create a variable with the filename that the observation was imported from, useful for datasets with lots of separated files( CENSO and RAIS)

UF

(optional) only for CENSO and RAIS. Use this option to read only the files for selected brazilian states. c("DF)

Value

a data.frame containing the imported data.


lucasmation/microdadosBrasil documentation built on Dec. 6, 2019, 7:14 p.m.