mlrl.common.cython.learner module¶
@author: Michael Rapp (michael.rapp.ml@gmail.com)
- class mlrl.common.cython.learner.BeamSearchTopDownRuleInductionMixin¶
Bases:
ABC
Allows to configure a rule learner to use a top-down beam search.
- abstract use_beam_search_top_down_rule_induction() BeamSearchTopDownRuleInductionConfig ¶
Configures the algorithm to use a top-down beam search for the induction of individual rules.
- Returns:
A BeamSearchTopDownRuleInductionConfig that allows further configuration of the algorithm for the induction of individual rules
- class mlrl.common.cython.learner.DefaultRuleMixin¶
Bases:
ABC
Allows to configure a rule learner to induce a default rule.
- abstract use_default_rule()¶
Configures the rule learner to induce a default rule.
- class mlrl.common.cython.learner.EqualFrequencyFeatureBinningMixin¶
Bases:
ABC
Allows to configure a rule learner to use equal-frequency feature binning.
- abstract use_equal_frequency_feature_binning() EqualFrequencyFeatureBinningConfig ¶
Configures the rule learner to use a method for the assignment of numerical feature values to bins, such that each bin contains approximately the same number of values.
- Returns:
An EqualFrequencyFeatureBinningConfig that allows further configuration of the method for the assignment of numerical feature values to bins
- class mlrl.common.cython.learner.EqualWidthFeatureBinningMixin¶
Bases:
ABC
Allows to configure a rule learner to use equal-width feature binning.
- abstract use_equal_width_feature_binning() EqualWidthFeatureBinningConfig ¶
Configures the rule learner to use a method for the assignment of numerical feature values to bins, such that each bin contains values from equally sized value ranges.
- Returns:
An EqualWidthFeatureBinningConfig that allows further configuration of the method for the assignment of numerical feature values to bins
- class mlrl.common.cython.learner.ExampleWiseStratifiedBiPartitionSamplingMixin¶
Bases:
ABC
Allows to configure a rule learner to partition the available training examples into a training set and a holdout set using stratification, where distinct label vectors are treated as individual classes.
- abstract use_example_wise_stratified_bi_partition_sampling() ExampleWiseStratifiedBiPartitionSamplingConfig ¶
Configures the rule learner to partition the available training examples into a training set and a holdout set using stratification, where distinct label vectors are treated as individual classes
- Returns:
An ExampleWiseStratifiedBiPartitionSamplingConfig that allows further configuration of the method for partitioning the available training examples into a training and a holdout set
- class mlrl.common.cython.learner.ExampleWiseStratifiedInstanceSamplingMixin¶
Bases:
ABC
Allows to configure a rule learner to use example-wise stratified instance sampling.
- abstract use_example_wise_stratified_instance_sampling() ExampleWiseStratifiedInstanceSamplingConfig ¶
Configures the rule learner to sample from the available training examples using stratification, where distinct label vectors are treated as individual classes, whenever a new rule should be learned.
- Returns:
An ExampleWiseStratifiedInstanceSamplingConfig that allows further configuration of the method for sampling instances
- class mlrl.common.cython.learner.FeatureSamplingWithoutReplacementMixin¶
Bases:
ABC
Allows to configure a rule learner to use feature sampling without replacement.
- abstract use_feature_sampling_without_replacement() FeatureSamplingWithoutReplacementConfig ¶
Configures the rule learner to sample from the available features with replacement whenever a rule should be refined.
- Returns:
A FeatureSamplingWithoutReplacementConfig that allows further configuration of the method for sampling features
- class mlrl.common.cython.learner.GreedyTopDownRuleInductionMixin¶
Bases:
ABC
Allows to configure a rule learner to use a greedy top-down search for the induction of individual rules.
- abstract use_greedy_top_down_rule_induction() GreedyTopDownRuleInductionConfig ¶
Configures the algorithm to use a greedy top-down search for the induction of individual rules.
- Returns:
A GreedyTopDownRuleInductionConfig that allows further configuration of the algorithm for the induction of individual rules
- class mlrl.common.cython.learner.InstanceSamplingWithReplacementMixin¶
Bases:
ABC
Defines an interface for all classes that allow to configure a rule learner to use instance sampling with replacement.
- abstract use_instance_sampling_with_replacement() InstanceSamplingWithReplacementConfig ¶
Configures the rule learner to sample from the available training examples with replacement whenever a new rule should be learned.
- Returns:
An InstanceSamplingWithReplacementConfig that allows further configuration of the method for sampling instances
- class mlrl.common.cython.learner.InstanceSamplingWithoutReplacementMixin¶
Bases:
ABC
Defines an interface for all classes that allow to configure a rule learner to use instance sampling without replacement.
- abstract use_instance_sampling_without_replacement() InstanceSamplingWithoutReplacementConfig ¶
Configures the rule learner to sample from the available training examples without replacement whenever a new rule should be learned.
- Returns:
An InstanceSamplingWithoutReplacementConfig that allows further configuration of the method for sampling instances
- class mlrl.common.cython.learner.IrepRulePruningMixin¶
Bases:
ABC
Allows to configure a rule learner to prune individual rules by following the principles of “incremental reduced error pruning” (IREP).
- abstract use_irep_rule_pruning()¶
Configures the rule learner to prune individual rules by following the principles of “incremental reduced error pruning” (IREP).
- class mlrl.common.cython.learner.LabelSamplingWithoutReplacementMixin¶
Bases:
ABC
Allows to configure a rule learner to use label sampling without replacement.
- abstract use_label_sampling_without_replacement() LabelSamplingWithoutReplacementConfig ¶
Configures the rule learner to sample from the available labels with replacement whenever a new rule should be learned.
- Returns:
A LabelSamplingWithoutReplacementConfig that allows further configuration of the method for sampling labels
- class mlrl.common.cython.learner.LabelWiseStratifiedBiPartitionSamplingMixin¶
Bases:
ABC
Allows to configure a rule learner to partition the available training examples into a training set and a holdout set using stratification, such that for each label the proportion of relevant and irrelevant examples is maintained.
- abstract use_label_wise_stratified_bi_partition_sampling() LabelWiseStratifiedBiPartitionSamplingConfig ¶
Configures the rule learner to partition the available training examples into a training set and a holdout set using stratification, such that for each label the proportion of relevant and irrelevant examples is maintained.
- Returns:
A LabelWiseStratifiedBiPartitionSamplingConfig that allows further configuration of the method for partitioning the available training examples into a training and a holdout set
- class mlrl.common.cython.learner.LabelWiseStratifiedInstanceSamplingMixin¶
Bases:
ABC
Allows to configure a rule learner to use label-wise stratified instance sampling.
- abstract use_label_wise_stratified_instance_sampling() LabelWiseStratifiedInstanceSamplingConfig ¶
Configures the rule learner to sample from the available training examples using stratification, such that for each label the proportion of relevant and irrelevant examples is maintained, whenever a new rule should be learned.
- Returns:
A LabelWiseStratifiedInstanceSamplingConfig that allows further configuration of the method for sampling instances
- class mlrl.common.cython.learner.NoFeatureBinningMixin¶
Bases:
ABC
Allows to configure a rule learner to not use any method for the assignment of numerical features values to bins.
- abstract use_no_feature_binning()¶
Configures the rule learner to not use any method for the assignment of numerical feature values to bins.
- class mlrl.common.cython.learner.NoFeatureSamplingMixin¶
Bases:
ABC
Allows to configure a rule learner to not use feature sampling.
- abstract use_no_feature_sampling()¶
Configures the rule learner to not sample from the available features whenever a rule should be refined.
- class mlrl.common.cython.learner.NoGlobalPruningMixin¶
Bases:
ABC
Allows to configure a rule learner to not use global pruning.
- abstract use_no_global_pruning()¶
Configures the rule learner to not use global pruning.
- class mlrl.common.cython.learner.NoInstanceSamplingMixin¶
Bases:
ABC
Defines an interface for all classes that allow to configure a rule learner to not use instance sampling.
- abstract use_no_instance_sampling()¶
Configures the rule learner to not sample from the available training examples whenever a new rule should be learned.
- class mlrl.common.cython.learner.NoJointProbabilityCalibrationMixin¶
Bases:
ABC
Allows to configure a rule learner to not calibrate joint probabilities.
- abstract use_no_joint_probability_calibration()¶
Configures the rule learner to not calibrate joint probabilities.
- class mlrl.common.cython.learner.NoLabelSamplingMixin¶
Bases:
ABC
Allows to configure a rule learner to not use label sampling.
- abstract use_no_label_sampling()¶
Configures the rule learner to not sample from the available labels whenever a new rule should be learned.
- class mlrl.common.cython.learner.NoMarginalProbabilityCalibrationMixin¶
Bases:
ABC
Allows to configure a rule learner to not calibrate marginal probabilities.
- abstract use_no_marginal_probability_calibration()¶
Configures the rule learner to not calibrate marginal probabilities.
- class mlrl.common.cython.learner.NoParallelPredictionMixin¶
Bases:
ABC
Allows to configure a rule learner to not use any multi-threading for prediction.
- abstract use_no_parallel_prediction()¶
Configures the rule learner to not use any multi-threading to predict for several query examples in parallel.
- class mlrl.common.cython.learner.NoParallelRuleRefinementMixin¶
Bases:
ABC
Allows to configure a rule learner to not use any multi-threading for the parallel refinement of rules.
- abstract use_no_parallel_rule_refinement()¶
Configures the rule learner to not use any multi-threading for the parallel refinement of rules.
- class mlrl.common.cython.learner.NoParallelStatisticUpdateMixin¶
Bases:
ABC
Allows to configure a rule learner to not use any multi-threading for the parallel update of statistics.
- abstract use_no_parallel_statistic_update()¶
Configures the rule learner to not use any multi-threading for the parallel update of statistics.
- class mlrl.common.cython.learner.NoPartitionSamplingMixin¶
Bases:
ABC
Allows to configure a rule learner to not partition the available training examples into a training set and a holdout set.
- abstract use_no_partition_sampling()¶
Configures the rule learner to not partition the available training examples into a training set and a holdout set.
- class mlrl.common.cython.learner.NoPostProcessorMixin¶
Bases:
ABC
Allows to configure a rule learner to not use any post processor.
- abstract use_no_post_processor()¶
Configures the rule learner to not use any post-processor.
- class mlrl.common.cython.learner.NoRulePruningMixin¶
Bases:
ABC
Allows to configure a rule learner to not prune individual rules.
- abstract use_no_rule_pruning()¶
Configures the rule learner to not prune individual rules.
- class mlrl.common.cython.learner.NoSequentialPostOptimizationMixin¶
Bases:
ABC
Allows to configure a rule learner to not use a post-optimization method that optimizes each rule in a model by relearning it in the context of the other rules.
- abstract use_no_sequential_post_optimization()¶
Configures the rule learner to not use a post-optimization method that optimizes each rule in a model by relearning it in the context of the other rules.
- class mlrl.common.cython.learner.NoSizeStoppingCriterionMixin¶
Bases:
ABC
Allows to configure a rule learner to not use a stopping criterion that ensures that the number of induced rules does not exceed a certain maximum.
- abstract use_no_size_stopping_criterion()¶
Configures the rule learner to not use a stopping criterion that ensures that the number of induced rules does not exceed a certain maximum.
- class mlrl.common.cython.learner.NoTimeStoppingCriterionMixin¶
Bases:
ABC
Allows to configure a rule learner to not use a stopping criterion that ensures that a certain time limit is not exceeded.
- abstract use_no_time_stopping_criterion()¶
Configures the rule learner to not use a stopping criterion that ensures that a certain time limit is not exceeded.
- class mlrl.common.cython.learner.ParallelPredictionMixin¶
Bases:
ABC
Allows to configure a rule learner to use multi-threading to predict for several examples in parallel.
- abstract use_parallel_prediction() ManualMultiThreadingConfig ¶
Configures the rule learner to use multi-threading to predict for several query examples in parallel.
- Returns:
A ManualMultiThreadingConfig that allows further configuration of the multi-threading behavior
- class mlrl.common.cython.learner.ParallelRuleRefinementMixin¶
Bases:
ABC
Allows to configure a rule learner to use multi-threading for the parallel refinement of rules.
- abstract use_parallel_rule_refinement() ManualMultiThreadingConfig ¶
Configures the rule learner to use multi-threading for the parallel refinement of rules.
- Returns:
A ManualMultiThreadingConfig that allows further configuration of the multi-threading behavior
- class mlrl.common.cython.learner.ParallelStatisticUpdateMixin¶
Bases:
ABC
Allows to configure a rule learner to use multi-threading for the parallel update of statistics.
- abstract use_parallel_statistic_update() ManualMultiThreadingConfig ¶
Configures the rule learner to use multi-threading for the parallel update of statistics.
- Returns:
A ManualMultiThreadingConfig that allows further configuration of the multi-threading behavior
- class mlrl.common.cython.learner.PostPruningMixin¶
Bases:
ABC
Allows to configure a rule learner to use a stopping criterion that keeps track of the number of rules in a model that perform best with respect to the examples in the training or holdout set according to a certain measure.
- abstract use_global_post_pruning() PostPruningConfig ¶
Configures the rule learner to use a stopping criterion that keeps track of the number of rules in a model that perform best with respect to the examples in the training or holdout set according to a certain measure.
- class mlrl.common.cython.learner.PrePruningMixin¶
Bases:
ABC
Allows to configure a rule learner to use a stopping criterion that stops the induction of rules as soon as the quality of a model’s predictions for the examples in the training or holdout set do not improve according to a certain measure.
- abstract use_global_pre_pruning() PrePruningConfig ¶
Configures the rule learner to use a stopping criterion that stops the induction of rules as soon as the quality of a model’s predictions for the examples in the training or holdout set do not improve according to a certain measure.
- Returns:
A PrePruningConfig that allows further configuration of the stopping criterion
- class mlrl.common.cython.learner.RandomBiPartitionSamplingMixin¶
Bases:
ABC
Allows to configure a rule learner to partition the available training example into a training set and a holdout set by randomly splitting the training examples into two mutually exclusive sets.
- abstract use_random_bi_partition_sampling() RandomBiPartitionSamplingConfig ¶
Configures the rule learner to partition the available training examples into a training set and a holdout set by randomly splitting the training examples into two mutually exclusive sets.
- Returns:
A RandomBiPartitionSamplingConfig that allows further configuration of the method for partitioning the available training examples into a training set and a holdout set
- class mlrl.common.cython.learner.RoundRobinLabelSamplingMixin¶
Bases:
ABC
Allows to configure a rule learner to sample single labels in a round-robin fashion.
- abstract use_round_robin_label_sampling()¶
Configures the rule learner to sample a single label in a round-robin fashion whenever a new rule should be learned.
- class mlrl.common.cython.learner.RuleLearner¶
Bases:
object
A rule learner.
- can_predict_binary(feature_matrix, num_labels) bool ¶
Returns whether the rule learner is able to predict binary labels or not.
- Parameters:
feature_matrix – A RowWiseFeatureMatrix that provides row-wise access to the feature values of the query examples
num_labels – The number of labels to predict for
- Returns:
True, if the rule learner is able to predict binary labels, False otherwise
- can_predict_probabilities(feature_matrix, num_labels) bool ¶
Returns whether the rule learner is able to predict probability estimates or not.
- Parameters:
feature_matrix – A RowWiseFeatureMatrix that provides row-wise access to the feature values of the query examples
num_labels – The number of labels to predict for
- Returns:
True, if the rule learner is able to predict probability estimates, False otherwise
- can_predict_scores(feature_matrix, num_labels) bool ¶
Returns whether the rule learner is able to predict regression scores or not.
- Parameters:
feature_matrix – A RowWiseFeatureMatrix that provides row-wise access to the feature values of the query examples
num_labels – The number of labels to predict for
- Returns:
True, if the rule learner is able to predict regression scores, False otherwise
- create_binary_predictor(feature_matrix, rule_model, label_space_info, marginal_probability_calibration_model, joint_probability_calibration_model, num_labels) BinaryPredictor ¶
Creates and returns a predictor that may be used to predict binary labels for given query examples. If the prediction of binary labels is not supported by the rule learner, a RuntimeError is thrown.
- Parameters:
feature_matrix – A RowWiseFeatureMatrix that provides row-wise access to the feature values of the query examples
rule_model – The RuleModel that should be used to obtain predictions
label_space_info – The LabelSpaceInfo that provides information about the label space that may be used as a basis for obtaining predictions
marginal_probability_calibration_model – The MarginalProbabilityCalibrationModel that may be used for the calibration of marginal probabilities
joint_probability_calibration_model – The JointProbabilityCalibrationModel that may be used for the calibration of joint probabilities
num_labels – The number of labels to predict for
- Returns:
A BinaryPredictor that may be used to predict binary labels for the given query examples
- create_probability_predictor(feature_matrix, rule_model, label_space_info, marginal_probability_calibration_model, joint_probability_calibration_model, num_labels) ProbabilityPredictor ¶
Creates and returns a predictor that may be used to predict probability estimates for given query examples. If the prediction of probability estimates is not supported by the rule learner, a RuntimeError is thrown.
- Parameters:
feature_matrix – A RowWiseFeatureMatrix that provides row-wise access to the feature values of the query examples
rule_model – The RuleModel that should be used to obtain predictions
label_space_info – The LabelSpaceInfo that provides information about the label space that may be used as a basis for obtaining predictions
marginal_probability_calibration_model – The MarginalProbabilityCalibrationModel that may be used for the calibration of marginal probabilities
joint_probability_calibration_model – The JointProbabilityCalibrationModel that may be used for the calibration of joint probabilities
num_labels – The number of labels to predict for
- Returns:
A ProbabilityPredictor that may be used to predict probability estimates for the given query examples
- create_score_predictor(feature_matrix, rule_model, label_space_info, num_labels) ScorePredictor ¶
Creates and returns a predictor that may be used to predict regression scores for given query examples. If the prediction of regression scores is not supported by the rule learner, a RuntimeError is thrown.
- Parameters:
feature_matrix – A RowWiseFeatureMatrix that provides row-wise access to the feature values of the query examples
rule_model – The RuleModel that should be used to obtain predictions
label_space_info – The LabelSpaceInfo that provides information about the label space that may be used as a basis for obtaining predictions
num_labels – The number of labels to predict for
- Returns:
A ScorePredictor that may be used to predict regression scores for the given query examples
- create_sparse_binary_predictor(feature_matrix, rule_model, label_space_info, marginal_probability_calibration_model, joint_probability_calibration_model, num_labels) SparseBinaryPredictor ¶
Creates and returns a predictor that may be used to predict sparse binary labels for given query examples. If the prediction of sparse binary labels is not supported by the rule learner, a RuntimeError is thrown.
- Parameters:
feature_matrix – A RowWiseFeatureMatrix that provides row-wise access to the feature values of the query examples
rule_model – The RuleModel that should be used to obtain predictions
label_space_info – The LabelSpaceInfo that provides information about the label space that may be used as a basis for obtaining predictions
marginal_probability_calibration_model – The MarginalProbabilityCalibrationModel that may be used for the calibration of marginal probabilities
joint_probability_calibration_model – The JointProbabilityCalibrationModel that may be used for the calibration of joint probabilities
num_labels – The number of labels to predict for
- Returns:
A SparseBinaryPredictor that may be used to predict sparse binary labels for the given query examples
- fit(feature_info, feature_matrix, label_matrix, random_state) TrainingResult ¶
Applies the rule learner to given training examples and corresponding ground truth labels.
- Parameters:
feature_info – A FeatureInfo that provides information about the types of individual features
feature_matrix – A ColumnWiseFeatureMatrix that provides column-wise access to the feature values of the training examples
label_matrix – A RowWiseLabelMatrix that provides row-wise access to the ground truth labels of the training examples
random_state – The seed to be used by random number generators
- Returns:
The TrainingResult that provides access to the result of fitting the rule learner to the training data
- class mlrl.common.cython.learner.SequentialPostOptimizationMixin¶
Bases:
ABC
Allows to configure a rule learner to use a post-optimization method that optimizes each rule in a model by relearning it in the context of the other rules.
- abstract use_sequential_post_optimization() SequentialPostOptimizationConfig ¶
Configures the rule learner to use a post-optimization method that optimizes each rule in a model by relearning it in the context of the other rules.
- Returns:
A SequentialPostOptimizationConfig that allows further configuration of the post-optimization method
- class mlrl.common.cython.learner.SequentialRuleModelAssemblageMixin¶
Bases:
ABC
Allows to configure a rule learner to use an algorithm that sequentially induces several rules.
- abstract use_sequential_rule_model_assemblage()¶
Configures the rule learner to use an algorithm that sequentially induces several rules, optionally starting with a default rule, that are added to a rule-based model.
- class mlrl.common.cython.learner.SizeStoppingCriterionMixin¶
Bases:
ABC
Allows to configure a rule learner to use a stopping criterion that ensures that the number of induced rules does not exceed a certain maximum.
- abstract use_size_stopping_criterion() SizeStoppingCriterionConfig ¶
Configures the rule learner to use a stopping criterion that ensures that the number of induced rules does not exceed a certain maximum.
- Returns:
A SizeStoppingCriterionConfig that allows further configuration of the stopping criterion
- class mlrl.common.cython.learner.TimeStoppingCriterionMixin¶
Bases:
ABC
Allows to configure a rule learner to use a stopping criterion that ensures that a certain time limit is not exceeded.
- use_time_stopping_criterion() TimeStoppingCriterionConfig ¶
Configures the rule learner to use a stopping criterion that ensures that a certain time limit is not exceeded.
- Returns:
A TimeStoppingCriterionConfig that allows further configuration of the stopping criterion
- class mlrl.common.cython.learner.TrainingResult¶
Bases:
object
Provides access to the results of fitting a rule learner to training data. It incorporates the model that has been trained, as well as additional information that is necessary for obtaining predictions for unseen data.
- joint_probability_calibration_model¶
- label_space_info¶
- marginal_probability_calibration_model¶
- num_labels¶
- rule_model¶