le.pesquisa: Function that reads microdata from the IBGE survey

Description Usage Arguments Details Value Author(s) References Examples

View source: R/le.pesquisa.R

Description

Performs a block read, recovering only the wanted variables from the IBGE surveys.

Usage

1
2
le.pesquisa(dicionario, pathname.in, codigos, rotulos = NULL, tbloco =
2000, nlines)

Arguments

dicionario

A data frame with 4 or 6 columns, supplied with this package. Contains the identification and description of all variables contained in the microdata file as well as other necessary information for reading. Each file must have a specific microdata dictionary.

pathname.in

The name of the file which the data are to be read from.

codigos

Array of characters with codes of the wanted variables to be read, as specified in dictionary.

rotulos

A optional variable formed by a data frame with 3 columns, supplied with this package. Contains the length and initial information all categorical variables contained in the microdata file.

tbloco

Integer that indicates the block size read. If not specified will be read in 2000 lines ever, an appropriate amount of a RAM 512MB computer. For a computer with less than 512MB of memory, tbloco must be less than 2000. For a machine with more 512MB of memory, tbloco may be larger than 2000, reducing the read time.

nlines

The number of lines of the file to be read.

Details

The le.pesquisa allows the reading of the microdata household surveys of IBGE, as the PNAD and PME. the microdata are files of text type, where the columns of data are arranged side by side without any separation. In fact, this function can be used to read any text file data that is organized in the same format, it is enough to provide a desired dictionary data file. The dictionaries of PNAD and PME are provided by IBGE in this package. To read more then 20 variables of the people PNAD, it is recommended that your computer has at least 1GB of memory.

Value

As a result of the reading is given an data frame only with the chosen variables and the number of rows from the original microdata. The names of the data frame columns are contained in the codigos argument .

Author(s)

Alexandre Rademaker alexandre.rademaker@fgv.br

Rafael F. Haeusler rafael.f.haeusler@gmail.com

References

See the original le.pesquisa from IBGEPesq package available at http://www.ibge.gov.br.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
 ## Not run: 
     library(dicionariosIBGE)
     data(dicPNAD2009)
    
     # Set this \code{path} to point to the microdata file:
     dados <- le.pesquisa(dicionario=dicdom2009,
                          pathname.in=path,
                          codigos=dicdom2009[c(4:12),2], nlines = 1000)

     #with \code{rotulos} != NULL
     # Set this \code{path} to point to the microdata file:
     dados <- le.pesquisa(dicionario=dicdom2009,
                          pathname.in=path,
                          codigos=dicdom2009[c(4:12),2],rotulos=rotdom2009, nlines = 1000)
                           
                          
    
## End(Not run)

dicionariosIBGE documentation built on May 1, 2019, 9:52 p.m.