getGuidData: Unique identifiers for all rows of dfr and their matches in a...

Description Usage Arguments Details Value Author(s) See Also Examples

View source: R/GLoMo.r

Description

Unique identifiers for all rows of dfr and their matches in glomo

Usage

1
getGuidData(glomo, dfr, guidPerObservation = NULL, whichHaveMissingCat, verbosity=0)

Arguments

glomo

GLoMo object (result of GLoMo call with a 'matching' dfr)

dfr

data.frame or numdfr to get the uids for.

guidPerObservation

if some of the information is already present (e.g. the uids of the rows, you can pass them here)

whichHaveMissingCat

to avoid expensive grep calls, you can pass in which of the rows of dfr have missing categorical columns. If not passed, this is found by grepling for "\\d+"

verbosity

The higher this value, the more levels of progress and debug information is displayed (note: in R for Windows, turn off buffered output)

Details

These uids are built with the function categoricalUniqueIdentifiers

Value

List object of class GuidData with items:

guidPerObservation

Unique identifier (char vector) for each row in dfr

possibleGlomoGuidPerObs

list with for each row of dfr the rows in glomo that can be used for it

separator

The separator that was used (in glomo and now in the new uids)

Author(s)

Nick Sabbe (nick.sabbe@ugent.be)

See Also

GLoMo-package, NumDfr, categoricalUniqueIdentifiers

Examples

1
2
3
4
5
6
7
iris.md<-randomNA(iris, 0.1)
iris.md.nd<-numdfr(iris.md)
iris.nd.rnd<-rCatsAndCntInDfr(iris.md.nd, orgriName=NULL, verbosity=1)
iris.weights<-iris.nd.rnd$weights
iris.nd.rnd<-iris.nd.rnd[,1:5]
iris.glomo<-GLoMo(iris.nd.rnd, weights=iris.weights, verbosity=1)
iris.guids<-getGuidData(iris.glomo, dfr=iris.md.nd)

GLoMo documentation built on May 2, 2019, 5:26 p.m.