modeva.models.MoNeuralTreeRegressor#
- class modeva.models.MoNeuralTreeRegressor(name: str = None, estimator: MoGLMTreeBoostRegressor = None, val_ratio=0.2, verbose=False, device=None, random_state=0, feature_names=None, nn_temperature=0.0001, nn_lr=0.0001, nn_max_epochs=200, nn_n_epoch_no_change=10, nn_batch_size=200, reg_mono=0.1, mono_sample_size=1000, mono_increasing_list=(), mono_decreasing_list=(), **kwargs)#
A neural network-based regression model that combines GLM trees with monotonicity constraints.
This model first fits a depth-1 boosted GLMTree, converts it to an equivalent neural network, and then fine-tunes the network parameters. It supports monotonicity constraints and provides interpretability through feature effects analysis.
- Parameters:
name (str, default=None) – Custom identifier for the model instance.
estimator (MoGLMTreeBoostRegressor, default=None) – Pre-fitted or unfitted GLMTree regressor for initialization. If None, creates a new instance.
feature_names (list of str, default=None) – Names of input features for interpretability.
val_ratio (float, default=0.2) – Proportion of data used for validation during training (0 to 1).
device (str, default=None) – Computing device for training (‘cpu’, ‘cuda’, etc.).
verbose (bool, default=False) – If True, prints training progress and statistics.
random_state (int, default=0) – Seed for reproducible random operations.
nn_temperature (float, default=0.0001) – Smoothing parameter for neural network activation.
nn_lr (float, default=0.001) – Learning rate for neural network optimization.
nn_max_epochs (int, default=200) – Maximum number of training epochs.
nn_batch_size (int, default=200) – Number of samples per training batch.
nn_n_epoch_no_change (int, default=10) – Early stopping patience - number of epochs without improvement.
reg_mono (float, default=0.1) – Strength of monotonicity regularization.
mono_sample_size (int, default=1000) – Number of random samples for monotonicity regularization.
mono_increasing_list (tuple of str, default=()) – Features that should have monotonically increasing relationships.
mono_decreasing_list (tuple of str, default=()) – Features that should have monotonically decreasing relationships.
**kwargs – Additional parameters passed to MoGLMTreeBoostRegressor.
- net_#
The internal Pytorch network object.
- Type:
object
- calibrate_interval(X, y, alpha=0.1, max_depth: int = 5)#
Fit a conformal prediction model to the given data.
This method computes the model’s prediction interval calibrated to the given data.
If the model is a regressor, splits the data with 50% for fitting lower (alpha / 5) and upper (1 - alpha / 2) gradient boosting trees-based quantile regression to the model’s residual; and 50% for calibration.
If the model is a binary classifiers, it computes the calibration quantile based on predicted probabilities for the positive class.
- Parameters:
X (X : np.ndarray of shape (n_samples, n_features)) – Feature matrix for prediction.
y (array-like of shape (n_samples, )) – Target values.
alpha (float, default=0.1) – Expected miscoverage for the conformal prediction.
max_depth (int, default=5) – Maximum depth of the gradient boosting trees for regression tasks. Only used when task_type is REGRESSION.
- Raises:
ValueError – If the model is neither a regressor nor a classifier.:
- get_params(deep=True)#
Get parameters for this estimator. :param deep: If True, will return the parameters for this estimator and
contained subobjects that are estimators.
- Returns:
params – Parameter names mapped to their values.
- Return type:
mapping of string to any
- load(file_name: str)#
Load the model into memory from file system.
- Parameters:
file_name (str) – The path and name of the file.
- Return type:
estimator object
- predict(X)#
Model predictions, calling the child class’s ‘_predict’ method.
- Parameters:
X (np.ndarray of shape (n_samples, n_features)) – Feature matrix for prediction.
- Returns:
np.ndarray
- Return type:
The (calibrated) final prediction
- predict_interval(X)#
Predict the prediction interval for the given data based on the conformal prediction model.
It splits the data with 50% for fitting lower (alpha / 5) and upper (1 - alpha / 2) gradient boosting trees-based quantile regression to the model’s residual; and 50% for calibration.
- Parameters:
X (np.ndarray of shape (n_samples, n_features)) – Feature matrix for prediction.
- Returns:
np.ndarray – in the format [n_samples, 2] for regressors or a flattened array for classifiers.
- Return type:
The lower and upper bounds of the prediction intervals for each sample
- Raises:
ValueError – If fit_conformal has not been called to fit the conformal prediction model: before calling this method.
- save(file_name: str)#
Save the model into file system.
- Parameters:
file_name (str) – The path and name of the file.
- set_params(**params)#
Set the parameters of this estimator.
The method works on simple estimators as well as on nested objects (such as
Pipeline
). The latter have parameters of the form<component>__<parameter>
so that it’s possible to update each component of a nested object.- Parameters:
**params (dict) – Estimator parameters.
- Returns:
self – Estimator instance.
- Return type:
estimator instance