Tabulation between Two Columns of an H2OFrame

Share:

Description

Simple Co-Occurrence based tabulation of X vs Y, where X and Y are two Vecs in a given dataset. Uses histogram of given resolution in X and Y. Handles numerical/categorical data and missing values. Supports observation weights.

Usage

1
2
h2o.tabulate(data, x, y, weights_column = NULL, nbins_x = 50,
  nbins_y = 50)

Arguments

data

An H2OFrame object.

x

predictor column

y

response column

weights_column

(optional) observation weights column

nbins_x

number of bins for predictor column

nbins_y

number of bins for response column

Value

Returns two TwoDimTables of 3 columns each count_table: X Y counts response_table: X meanY counts

Examples

1
2
3
4
5
6
library(h2o)
h2o.init()
df <- as.h2o(iris)
tab <- h2o.tabulate(data = df, x = "Sepal.Length", y = "Petal.Width",
             weights_column = NULL, nbins_x = 10, nbins_y = 10)
plot(tab)

Want to suggest features or report bugs for rdrr.io? Use the GitHub issue tracker.