findviews_to_compare
detects views on which two arbitrary sets
of rows differ. It plots the results with ggplot and Shiny.
1 2  findviews_to_compare(group1, group2, data, view_size_max = NULL,
clust_method = "complete", ...)

group1 
Logical vector of size 
group2 
Logical vector, which describes the second group to compare.
The value 
data 
Data frame or matrix to be processed 
view_size_max 
Maximum number of columns in the views. If set to

clust_method 
Character describing a clustering method, used internally
by 
... 
Optional Shiny parameters, used in Shiny's

The function findviews_to_compare
takes two groups of rows as input
and detects views on which the statistical distribution of those two groups
differ.
To detect the set of views, findviews_to_compare
eliminates
the rows which are present in neither group and applies findviews
.
To evaluate the differentiation power of the views, findviews computes the histograms of the two groups to be compared, and computes their dissimilarity them with the Euclidean distance.
This method is loosely based on the following paper:
1 2 3  Fast, Explainable View Detection to Characterize Exploration Queries
Thibault Sellam, Martin Kersten
SSDBM, 2016

1 2 3 4  ## Not run:
findviews_to_compare(mtcars$mpg >= 20 , mtcars$mpg < 20 , mtcars)
## End(Not run)

Questions? Problems? Suggestions? Tweet to @rdrrHQ or email at ian@mutexlabs.com.
All documentation is copyright its authors; we didn't write any of that.