modeva.models.MoNeuralTreeRegressor#

class modeva.models.MoNeuralTreeRegressor(name: str = None, estimator: MoGLMTreeBoostRegressor = None, val_ratio=0.2, verbose=False, device=None, random_state=0, feature_names=None, nn_temperature=0.0001, nn_lr=0.0001, nn_max_epochs=200, nn_n_epoch_no_change=10, nn_batch_size=200, reg_mono=0.1, mono_sample_size=1000, mono_increasing_list=(), mono_decreasing_list=(), **kwargs)#

A neural network-based regression model that combines GLM trees with monotonicity constraints.

This model first fits a depth-1 boosted GLMTree, converts it to an equivalent neural network, and then fine-tunes the network parameters. It supports monotonicity constraints and provides interpretability through feature effects analysis.

Parameters:
  • name (str, default=None) – Custom identifier for the model instance.

  • estimator (MoGLMTreeBoostRegressor, default=None) – Pre-fitted or unfitted GLMTree regressor for initialization. If None, creates a new instance.

  • feature_names (list of str, default=None) – Names of input features for interpretability.

  • val_ratio (float, default=0.2) – Proportion of data used for validation during training (0 to 1).

  • device (str, default=None) – Computing device for training (‘cpu’, ‘cuda’, etc.).

  • verbose (bool, default=False) – If True, prints training progress and statistics.

  • random_state (int, default=0) – Seed for reproducible random operations.

  • nn_temperature (float, default=0.0001) – Smoothing parameter for neural network activation.

  • nn_lr (float, default=0.001) – Learning rate for neural network optimization.

  • nn_max_epochs (int, default=200) – Maximum number of training epochs.

  • nn_batch_size (int, default=200) – Number of samples per training batch.

  • nn_n_epoch_no_change (int, default=10) – Early stopping patience - number of epochs without improvement.

  • reg_mono (float, default=0.1) – Strength of monotonicity regularization.

  • mono_sample_size (int, default=1000) – Number of random samples for monotonicity regularization.

  • mono_increasing_list (tuple of str, default=()) – Features that should have monotonically increasing relationships.

  • mono_decreasing_list (tuple of str, default=()) – Features that should have monotonically decreasing relationships.

  • **kwargs – Additional parameters passed to MoGLMTreeBoostRegressor.

net_#

The internal Pytorch network object.

Type:

object

calibrate_interval(X, y, alpha=0.1, max_depth: int = 5)#

Fit a conformal prediction model to the given data.

This method computes the model’s prediction interval calibrated to the given data.

If the model is a regressor, splits the data with 50% for fitting lower (alpha / 5) and upper (1 - alpha / 2) gradient boosting trees-based quantile regression to the model’s residual; and 50% for calibration.

If the model is a binary classifiers, it computes the calibration quantile based on predicted probabilities for the positive class.

Parameters:
  • X (X : np.ndarray of shape (n_samples, n_features)) – Feature matrix for prediction.

  • y (array-like of shape (n_samples, )) – Target values.

  • alpha (float, default=0.1) – Expected miscoverage for the conformal prediction.

  • max_depth (int, default=5) – Maximum depth of the gradient boosting trees for regression tasks. Only used when task_type is REGRESSION.

Raises:

ValueError – If the model is neither a regressor nor a classifier.:

get_params(deep=True)#

Get parameters for this estimator. :param deep: If True, will return the parameters for this estimator and

contained subobjects that are estimators.

Returns:

params – Parameter names mapped to their values.

Return type:

mapping of string to any

load(file_name: str)#

Load the model into memory from file system.

Parameters:

file_name (str) – The path and name of the file.

Return type:

estimator object

predict(X)#

Model predictions, calling the child class’s ‘_predict’ method.

Parameters:

X (np.ndarray of shape (n_samples, n_features)) – Feature matrix for prediction.

Returns:

np.ndarray

Return type:

The (calibrated) final prediction

predict_interval(X)#

Predict the prediction interval for the given data based on the conformal prediction model.

It splits the data with 50% for fitting lower (alpha / 5) and upper (1 - alpha / 2) gradient boosting trees-based quantile regression to the model’s residual; and 50% for calibration.

Parameters:

X (np.ndarray of shape (n_samples, n_features)) – Feature matrix for prediction.

Returns:

np.ndarray – in the format [n_samples, 2] for regressors or a flattened array for classifiers.

Return type:

The lower and upper bounds of the prediction intervals for each sample

Raises:

ValueError – If fit_conformal has not been called to fit the conformal prediction model: before calling this method.

save(file_name: str)#

Save the model into file system.

Parameters:

file_name (str) – The path and name of the file.

set_params(**params)#

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as Pipeline). The latter have parameters of the form <component>__<parameter> so that it’s possible to update each component of a nested object.

Parameters:

**params (dict) – Estimator parameters.

Returns:

self – Estimator instance.

Return type:

estimator instance