freqItems: Finding frequent items for columns, possibly with false...
In danzafar/tidyspark: A Tidy Interface to Spark

Description Usage Arguments Value Note See Also Examples

Finding frequent items for columns, possibly with false positives. Using the frequent element count algorithm described in https://doi.org/10.1145/762471.762473, proposed by Karp, Schenker, and Papadimitriou.

1	freqItems(x, cols, support = 0.01)

`x`	A spark_tbl.
`cols`	A vector column names to search frequent items in.
`support`	(Optional) The minimum frequency for an item to be considered `frequent`. Should be greater than 1e-4. Default support = 0.01.

a local R data.frame with the frequent items in each column

freqItems since 1.6.0

Other stat functions: approxQuantile(), corr(), covariance(), crosstab(), sampleBy()

## Not run: 
df <- read.json("/path/to/file.json")
fi = freqItems(df, c("title", "gender"))

## End(Not run)

danzafar/tidyspark documentation built on Sept. 30, 2020, 12:19 p.m.

danzafar/tidyspark index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

danzafar/tidyspark
A Tidy Interface to Spark

freqItems: Finding frequent items for columns, possibly with false...
In danzafar/tidyspark: A Tidy Interface to Spark

Description

Usage

Arguments

Value

Note

See Also

Examples

Related to freqItems in danzafar/tidyspark...

R Package Documentation

Browse R Packages

We want your feedback!

danzafar/tidyspark A Tidy Interface to Spark

freqItems: Finding frequent items for columns, possibly with false... In danzafar/tidyspark: A Tidy Interface to Spark

Description

Usage

Arguments

Value

Note

See Also

Examples

Related to freqItems in danzafar/tidyspark...

R Package Documentation

Browse R Packages

We want your feedback!

danzafar/tidyspark
A Tidy Interface to Spark

freqItems: Finding frequent items for columns, possibly with false...
In danzafar/tidyspark: A Tidy Interface to Spark