.parse_feature_selection_settings | R Documentation |
Internal function for parsing settings related to feature selection
.parse_feature_selection_settings(
config = NULL,
data,
parallel,
outcome_type,
fs_method = waiver(),
fs_method_parameter = waiver(),
vimp_aggregation_method = waiver(),
vimp_aggregation_rank_threshold = waiver(),
parallel_feature_selection = waiver(),
...
)
config |
A list of settings, e.g. from an xml file. |
data |
Data set as loaded using the |
parallel |
Logical value that whether familiar uses parallelisation. If
|
outcome_type |
Type of outcome found in the data set. |
fs_method |
(required) Feature selection method to be used for
determining variable importance. More than one feature selection method can be chosen. The experiment will then repeated for each feature selection method. Feature selection methods determines the ranking of features. Actual selection of features is done by optimising the signature size model hyperparameter during the hyperparameter optimisation step. |
fs_method_parameter |
(optional) List of lists containing parameters for feature selection methods. Each sublist should have the name of the feature selection method it corresponds to. Most feature selection methods do not have parameters that can be set. Please refer to the vignette on feature selection methods for more details. Note that if the feature selection method is based on a learner (e.g. lasso regression), hyperparameter optimisation may be performed prior to assessing variable importance. |
vimp_aggregation_method |
(optional) The method used to aggregate variable importances over different data subsets, e.g. bootstraps. The following methods can be selected:
The feature selection methods vignette provides additional information. |
vimp_aggregation_rank_threshold |
(optional) The threshold used to define the subset of highly important features. If not set, this threshold is determined by maximising the variance in the occurrence value over all features over the subset size. This parameter is only relevant for |
parallel_feature_selection |
(optional) Enable parallel processing for
the feature selection workflow. Defaults to |
... |
Unused arguments. |
List of parameters related to feature selection.
Wald, R., Khoshgoftaar, T. M., Dittman, D., Awada, W. & Napolitano, A. An extensive comparison of feature ranking aggregation techniques in bioinformatics. in 2012 IEEE 13th International Conference on Information Reuse Integration (IRI) 377–384 (2012).
Meinshausen, N. & Buehlmann, P. Stability selection. J. R. Stat. Soc. Series B Stat. Methodol. 72, 417–473 (2010).
Haury, A.-C., Gestraud, P. & Vert, J.-P. The influence of feature selection methods on accuracy, stability and interpretability of molecular signatures. PLoS One 6, e28210 (2011).
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.