Description Usage Arguments Functions
Internally this uses data.table, because it's fast and we want the speed.
1 2 3 4 5 6 | group_by_local_shuffle(dir, nworkers = 3L,
assign_groups = data_local_group_assign, group_fun = median)
greedy_group_assign(P, nworkers)
data_local_group_assign(P, w)
|
dir |
directory where data can be found |
nworkers |
number of workers |
assign_groups |
function to assign files and groups to workers, signature must match default |
group_fun |
function to apply to each group |
greedy_group_assign
: Greedy scheduling to assign groups in a GROUP BY to workers
data_local_group_assign
: Data location aware scheduling to assign groups in a GROUP BY to workers
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.