join.tbl_spark | R Documentation |
These functions are wrappers around their 'dplyr' equivalents that set Spark SQL-compliant values for the 'suffix' argument by replacing dots ('.') with underscores ('_'). See [join] for a description of the general purpose of the functions.
## S3 method for class 'tbl_spark'
inner_join(
x,
y,
by = NULL,
copy = FALSE,
suffix = c("_x", "_y"),
auto_index = FALSE,
...,
sql_on = NULL
)
## S3 method for class 'tbl_spark'
left_join(
x,
y,
by = NULL,
copy = FALSE,
suffix = c("_x", "_y"),
auto_index = FALSE,
...,
sql_on = NULL
)
## S3 method for class 'tbl_spark'
right_join(
x,
y,
by = NULL,
copy = FALSE,
suffix = c("_x", "_y"),
auto_index = FALSE,
...,
sql_on = NULL
)
## S3 method for class 'tbl_spark'
full_join(
x,
y,
by = NULL,
copy = FALSE,
suffix = c("_x", "_y"),
auto_index = FALSE,
...,
sql_on = NULL
)
x , y |
A pair of lazy data frames backed by database queries. |
by |
A join specification created with If To join on different variables between To join by multiple variables, use a
For simple equality joins, you can alternatively specify a character vector
of variable names to join by. For example, To perform a cross-join, generating all combinations of |
copy |
If This allows you to join tables across srcs, but it's potentially expensive operation so you must opt into it. |
suffix |
If there are non-joined duplicate variables in |
auto_index |
if |
... |
Other parameters passed onto methods. |
sql_on |
A custom join predicate as an SQL expression.
Usually joins use column equality, but you can perform more complex
queries by supply |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.