Python sklearn PredefinedSplit用法及代碼示例

本文簡要介紹python語言中 sklearn.model_selection.PredefinedSplit 的用法。

用法: class sklearn.model_selection.PredefinedSplit(test_fold)

預定義拆分cross-validator

提供訓練/測試索引，使用用戶使用test_fold 參數指定的預定義方案將數據拆分為訓練/測試集。

在用戶指南中閱讀更多信息。

參數：

test_fold：形狀類似數組 (n_samples,): 條目test_fold[i] 表示樣本i 所屬的測試集的索引。通過將test_fold[i]設置為-1，可以從任何測試集中排除樣本i(即在每個訓練集中包括樣本i)。

例子：

>>> import numpy as np
>>> from sklearn.model_selection import PredefinedSplit
>>> X = np.array([[1, 2], [3, 4], [1, 2], [3, 4]])
>>> y = np.array([0, 0, 1, 1])
>>> test_fold = [0, 1, -1, 1]
>>> ps = PredefinedSplit(test_fold)
>>> ps.get_n_splits()
2
>>> print(ps)
PredefinedSplit(test_fold=array([ 0,  1, -1,  1]))
>>> for train_index, test_index in ps.split():
...     print("TRAIN:", train_index, "TEST:", test_index)
...     X_train, X_test = X[train_index], X[test_index]
...     y_train, y_test = y[train_index], y[test_index]
TRAIN: [1 2 3] TEST: [0]
TRAIN: [0 2] TEST: [1 3]

相關用法

注：本文由純淨天空篩選整理自scikit-learn.org大神的英文原創作品 sklearn.model_selection.PredefinedSplit。非經特殊聲明，原始代碼版權歸原作者所有，本譯文未經允許或授權，請勿轉載或複製。