modeva.DataSet.get_X_y_data#

DataSet.get_X_y_data(dataset: str = 'main')#

Get the preprocessed data in the form of X, y, sample_weight.

Only active samples are returned. For extra data, no subsampling exists.

Parameters:

dataset ({"main", "train", "test"}, default="main") – The name of data split. It can also be other manually registered data split, if exists. Use the function get_data_list to check all available data splits.

Returns:

X (np.ndarray) – The given dataset’s X, will be None if not available.
y (np.ndarray) – The given dataset’s y, will be None if not available.
sample_weight (np.ndarray) – The given dataset’s sample_weight, will be None if not available.