Description Usage Arguments Details Value Examples
Searches a statistically significant decomposable model to explain a dataset using Prioritized Chordalysis.
1 | ChoR.Budget(x, pValueThreshold = 0.05, budgetShare = 0.01, card = NULL)
|
x |
A dataframe with categorical data; column names are the name of the attributes. |
pValueThreshold |
A double value, minimum p-value for statistical consistency (commonly 0.05) |
budgetShare |
A double value, share of the statistical budget to consume at each step (>0 and <=1; 0.01 seems like a reasonable value for most datasets) |
card |
A vector containing the cardinality of the attributes (position wise). |
Call the Budget chordalysis function on the dataframe x. The optionnal card argument can provide a vector of cardinalities for each attribute (i.e. column) of the dataframe. If absent, the cardinalities are computed from the dataframe, but not accurate if some possible values never show up. See papers "Scaling log-linear analysis to high-dimensional data, ICDM 2013", "Scaling log-linear analysis to datasets with thousands of variables, SDM 2015", and "A multiple test correction for streams and cascades of statistical hypothesis tests, KDD 2016" for more details.
A Chordalysis object. Use ChoR.as.*
functions to access the result.
1 2 3 | ## Not run: res = ChoR.Budget(data)
## Not run: res = ChoR.Budget(data, budgetShare=0.0)
## Not run: res = ChoR.Budget(data, 0.05, card = c(3, 5, 4, 4, 3, 2, 3, 3))
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.