spark_auto_broadcast_join_threshold: Retrieves or sets the auto broadcast join threshold
In sparklyr: R Interface to Apache Spark

spark_auto_broadcast_join_threshold

R Documentation

Retrieves or sets the auto broadcast join threshold

Description

Configures the maximum size in bytes for a table that will be broadcast to all worker nodes when performing a join. By setting this value to -1 broadcasting can be disabled. Note that currently statistics are only supported for Hive Metastore tables where the command 'ANALYZE TABLE <tableName> COMPUTE STATISTICS noscan' has been run, and file-based data source tables where the statistics are computed directly on the files of data.

Usage

spark_auto_broadcast_join_threshold(sc, threshold = NULL)

Arguments

`sc`	A `spark_connection`.
`threshold`	Maximum size in bytes for a table that will be broadcast to all worker nodes when performing a join. Defaults to `NULL` to retrieve configuration entries.

sparklyr
R Interface to Apache Spark

spark_auto_broadcast_join_threshold: Retrieves or sets the auto broadcast join threshold
In sparklyr: R Interface to Apache Spark

Retrieves or sets the auto broadcast join threshold

Description

Usage

Arguments

See Also

Related to spark_auto_broadcast_join_threshold in sparklyr...

R Package Documentation

Browse R Packages

We want your feedback!

sparklyr R Interface to Apache Spark

spark_auto_broadcast_join_threshold: Retrieves or sets the auto broadcast join threshold In sparklyr: R Interface to Apache Spark

Retrieves or sets the auto broadcast join threshold

Description

Usage

Arguments

See Also

Related to spark_auto_broadcast_join_threshold in sparklyr...

R Package Documentation

Browse R Packages

We want your feedback!

sparklyr
R Interface to Apache Spark

spark_auto_broadcast_join_threshold: Retrieves or sets the auto broadcast join threshold
In sparklyr: R Interface to Apache Spark