| LOGANTree | R Documentation | 
This package enables users to model log-file data from computer-based assessments using machine-learning techniques. It allows researchers to generate new knowledge by comparing the performance of three tree-based classification models (i.e., decision trees, random forest, and gradient boosting) to predict student’s outcome. It also contains a set of handful functions for the analysis of the features’ influence on the modeling. Data from the Climate control item from the 2012 Programme for International Student Assessment (PISA, <https://www.oecd.org/pisa/>) is available for an illustration of the package’s capability. An application of the package functions for a math item in PISA 2012 is described in Qin (2022).
The LOGANTree functions can be categorized in two types: (a) tree-based modeling and (b) features’ analysis. While the first one provides tools for the specification and the evaluation of the three classification models, the second category is devoted to a careful analysis of the data features and their influence on the model’s results. We use the caret package to perform most of the analyses and we provide summary reports and data visualization tools to better compare the three classifiers.
What follows is a list of functions organized per category:
Tree-based modeling:
TreeModels
DataPartition
TreeModelsAllSteps
PerformanceMatrics
RocPlot
Features’ analysis:
NearZeroVariance
DtResult(
VariableImportanceTable
VariableImportancePlot
ChisquareTable
ChisquarePlot
PartialDependencePlot
Qi Qin [aut, cre],
Denise Reis Costa [aut, ths]
Qin, Q. (2022). Application of tree-based data mining techniques to examine log file data from a 2012 PISA computer-based Mathematics item. [Unpublished thesis]. University of Oslo.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.