Description Arguments Details Examples
This package contains methods, which enables clustering in dataframes. Particularly useful for bio-mathematics, cognitive sciences, etc.
tib |
Tibble/Dataframe to be clustered. Method also possible with vectors. |
by |
string vector. Specifies the column(s) for geometric data, according to which the clusters are to be built. |
filter.by |
string vector. Defaults to |
keep |
string vector. Defaults to |
near |
symmetric function in two arguments. This function operates pairs of entries in the columns with geometric data and returns |
min.dist |
a real number. Defaults to |
max.dist |
a real number. Defaults to |
strict |
boolean. Defaults to |
cluster.name |
string. Defaults to |
min.size |
a natural number. Defaults to |
max.size |
a natural number. Defaults to |
split |
boolean. Defaults to |
is.lexical |
boolean. Defaults to |
no.overlaps |
boolean. Defaults to |
summary |
boolean. Defaults to |
as.interval |
boolean. Defaults to |
cd <- clusterby::clusterdataframe(tib)
cd$build(...)
cd$summarise(...)
cd$get('original', ...)
cd$get('clusters', summary=<lgl>, ...)
1 2 3 4 5 6 7 8 9 10 11 | cdf <- clusterby::clusterdataframe(gene);
cdf$build(by='position', filter.by=c('gene','actve'), min.size=4, max.dist=400, strict=TRUE, is.lexical=TRUE, no.overlaps=TRUE);
cdf <- clusterby::clusterdataframe(protein3d);
cdf$build(by=c('x','y','z'), filter.by='celltype', max.dist=5.8e-7, cluster.name='segment');
cdf <- clusterby::clusterdataframe(soil_data);
cdf$build(by=c('x','y'), filter.by=c('density','substance'), max.dist=10e-3, cluster.name='clump');
data <- cdf$get('clusters');
tib <- cdf$get('clusters', keep=c('colour','age'));
tib <- cdf$get('clusters', summary=FALSE);
tib_summ <- cdf$get('clusters', summary=TRUE, as.interval=TRUE);
tib_summ <- cdf$get('clusters', summary=TRUE, as.interval=FALSE);
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.