prepareDSList: Prepares a data frame 'dsList', which describes the data...

Description Usage Arguments Details Value Author(s) See Also Examples

View source: R/functions.R

Description

The data frame dsList is needed to read the data contained in a directory tree below dsList$pathData using dsRead(). The directory pathDescription is expected to contain the file contents.xml and subdirectory scripts with R scripts for reading the data sets.

Usage

1
prepareDSList(pathData, pathDescription)

Arguments

pathData

Character. A path to the required data directory.

pathDescription

Character. A path to a directory containing description of the required data, in particular the file "contents.xml".

Details

The character "~" expands to your home directory.

The directory pathData need not contain all the data sets included in pathDescription/contents.xml. The function getAvailable() is called and its output is stored in column availability of the output data frame, which is logical and specifies for each data set, whether it is or is not present.

See http://www.cs.cas.cz/~savicky/readMLData/ for description files of some of the data sets from UCI Machine Learning Repository. See the help page readMLData for more information on the structure of the description files.

Value

Data frame with columns pathData, pathDescription, and other as listed by getFields(). The output data frame can be used as dsList parametr of functions dsSearch(), dsRead(), checkConsistency(), checkType().

Author(s)

Petr Savicky

See Also

readMLData, getAvailable, checkConsistency.

Examples

1
2
3
  pathData <- getPath("exampleData")
  pathDescription <- getPath("exampleDescription")
  dsList <- prepareDSList(pathData, pathDescription)

Example output

3 data sets available

readMLData documentation built on May 2, 2019, 7:49 a.m.