| cb_checkpoint | R Documentation |
Forces materialization of a lazy Spark plan. Useful in long pipelines where query plans get too deep and the optimizer starts re-computing upstream steps. For local data frames, this is a no-op.
cb_checkpoint(sdf, name = NULL, mode = c("memory", "disk", "register"))
sdf |
A Spark DataFrame (tbl_spark) or local data frame. |
name |
Optional. Name to register the checkpoint under (Spark only). If NULL, a temporary name is generated. |
mode |
Character. One of |
The (possibly materialized) data frame.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.