mlrl.testbed_sklearn.experiments.input.dataset.splitters.extension module

Author: Michael Rapp (michael.rapp.ml@gmail.com)

Provides classes that allow configuring the functionality to split datasets into training and test datasets.

class mlrl.testbed_sklearn.experiments.input.dataset.splitters.extension.DatasetSplitterExtension(*dependencies: Extension)

Bases: Extension

An extension that configures the functionality to split tabular datasets into training and test datasets.

DATASET_FORMAT = <mlrl.util.cli.SetArgument object>
DATASET_READER_EXTENSIONS: list[DatasetFileExtension] = [<mlrl.testbed_sklearn.experiments.input.dataset.extension.ArffFileExtension object>, <mlrl.testbed_sklearn.experiments.input.dataset.extension.SvmFileExtension object>]
static get_dataset_splitter(args: Namespace, load_dataset: bool = True) DatasetSplitter

Returns the DatasetSplitter to be used for splitting datasets into training and test datasets according to the configuration.

Parameters:
  • args – The command line arguments specified by the user

  • load_dataset – True, if the dataset should be loaded, False otherwise

Returns:

The DatasetSplitter to be used

static get_random_state(args: Namespace) int

Returns the seed to be used by random number generators.

Parameters:

args – The command line arguments specified by the user

Returns:

The seed to be used

get_supported_modes() set[ExperimentMode]

See mlrl.testbed.extensions.extension.Extension.get_supported_modes()