mlrl.common.cython.partition_sampling module¶

@author: Michael Rapp (michael.rapp.ml@gmail.com)

class mlrl.common.cython.partition_sampling.ExampleWiseStratifiedBiPartitionSamplingConfig¶

Bases: object

Allows to configure a method for partitioning the available training examples into a training set and a holdout set using stratification, where distinct label vectors are treated as individual classes.

get_holdout_set_size() → float¶

Returns the fraction of examples that are included in the holdout set.

Returns:: The fraction of examples that are included in the holdout set

set_holdout_set_size(holdout_set_size: float) → ExampleWiseStratifiedBiPartitionSamplingConfig¶

Sets the fraction of examples that should be included in the holdout set.

Parameters:: holdout_set_size – The fraction of examples that should be included in the holdout set, e.g., a value of 0.6 corresponds to 60 % of the available examples. Must be in (0, 1)
Returns:: An ExampleWiseStratifiedBiPartitionSamplingConfig that allows further configuration of the method for partitioning the available training examples into a training set and a holdout set

class mlrl.common.cython.partition_sampling.LabelWiseStratifiedBiPartitionSamplingConfig¶

Bases: object

Allows to configure a method for partitioning the available training examples into a training set and a holdout set using stratification, such that for each label the proportion of relevant and irrelevant examples is maintained.

get_holdout_set_size() → float¶

Returns the fraction of examples that are included in the holdout set.

Returns:: The fraction of examples that are included in the holdout set

set_holdout_set_size(holdout_set_size: float) → LabelWiseStratifiedBiPartitionSamplingConfig¶

Sets the fraction of examples that should be included in the holdout set.

Parameters:: holdout_set_size – The fraction of examples that should be included in the holdout set, e.g., a value of 0.6 corresponds to 60 % of the available examples. Must be in (0, 1)
Returns:: An LabelWiseStratifiedBiPartitionSamplingConfig that allows further configuration of the method for partitioning the available training examples into a training set and a holdout set

class mlrl.common.cython.partition_sampling.RandomBiPartitionSamplingConfig¶

Bases: object

Allows to configure a method for partitioning the available training examples into a training set and a holdout set that randomly splits the training examples into two mutually exclusive sets.

get_holdout_set_size() → float¶

Returns the fraction of examples that are included in the holdout set.

Returns:: The fraction of examples that are included in the holdout set

set_holdout_set_size(holdout_set_size: float) → RandomBiPartitionSamplingConfig¶

Sets the fraction of examples that should be included in the holdout set.

Parameters:: holdout_set_size – The fraction of examples that should be included in the holdout set, e.g., a value of 0.6 corresponds to 60 % of the available examples. Must be in (0, 1)
Returns:: A RandomBiPartitionSamplingConfig that allows further configuration of the method for partitioning the available training examples into a training set and a holdout set