modeva.DataSet.get_X_y_data#
- DataSet.get_X_y_data(dataset: str = 'main')#
Get the preprocessed data in the form of X, y, sample_weight.
Only active samples are returned. For extra data, no subsampling exists.
- Parameters:
dataset ({"main", "train", "test"}, default="main") – The name of data split. It can also be other manually registered data split, if exists. Use the function get_data_list to check all available data splits.
- Returns:
X (np.ndarray) – The given dataset’s X, will be None if not available.
y (np.ndarray) – The given dataset’s y, will be None if not available.
sample_weight (np.ndarray) – The given dataset’s sample_weight, will be None if not available.