training
and test
data and training time can be compared. The y-axis
shows the selected model performance metric and x-axis
the training time.confusion matrix
. Optionally show actual counts or percent for correct and incorrect classifications.Feature selection
is used to identify variables which maximize model performance. Optimal variables are identified using recursive feature elimination wherein many models are built from subsets of variables and an optimal model is identified based on which subset yielded the highest performing model.calculate
to optimized subsets for that metric). The optimal variables can be selected based on the subset function
. Options include PickSizeBest
which specifies the subset which maximized or minimized the chosen performance metric or PickSizeTolerance
which allows for models with less parameters (variables), which are also worse than the optimal model. The accepted decrease in performance is specified as a percent of the metric in tolerance
.