Python cuml.datasets.make_blobs用法及代碼示例

用法: cuml.datasets.make_blobs(n_samples=100, n_features=2, centers=None, cluster_std=1.0, center_box=(- 10.0, 10.0), shuffle=True, random_state=None, return_centers=False, order='F', dtype='float32')

生成用於聚類的各向同性高斯斑點。

參數：

n_samples：int 或array-like，可選(默認=100): 如果是 int，則它是在集群之間平均分配的點的總數。如果array-like，則序列的每個元素表示每個簇的樣本數。
n_features：int，可選(默認=2): 每個樣本的特征數。
centers：int 或形狀數組 [n_centers , n_features]，可選: (默認=無)要生成的中心數，或固定的中心位置。如果 n_samples 是 int 並且中心是 None，則生成 3 個中心。如果 n_samples 是 array-like，則中心必須是 None 或長度等於 n_samples 長度的數組。
cluster_std：浮點數或浮點數序列，可選(默認=1.0): 聚類的標準差。
center_box：一對浮點數(最小，最大)，可選(默認=(-10.0，10.0)): 隨機生成中心時每個聚類中心的邊界框。
shuffle：布爾值，可選(默認 = True): Shuffle[洗牌]樣本。
random_state：int，RandomState 實例，默認=None: 確定數據集創建的隨機數生成。跨多個函數調用傳遞一個 int 以實現可重現的輸出。
return_centers：布爾，可選(默認=假): 如果為真，則返回每個聚類的中心
order: str, optional (default=’F’)：: 生成樣本的順序
dtype：str，可選(默認='float32'): 生成樣本的 Dtype

X：形狀為 [n_samples, n_features] 的設備數組: 生成的樣本。
y：形狀為 [n_samples] 的設備數組: 每個樣本的集群成員的整數標簽。
centers：設備數組，形狀 [n_centers, n_features]: 每個集群的中心。僅在 return_centers=True 時返回。

例子：

>>> from sklearn.datasets import make_blobs
>>> X, y = make_blobs(n_samples=10, centers=3, n_features=2,
...                   random_state=0)
>>> print(X.shape)
(10, 2)
>>> y
array([0, 0, 1, 0, 2, 2, 2, 1, 1, 0])
>>> X, y = make_blobs(n_samples=[3, 3, 4], centers=None, n_features=2,
...                   random_state=0)
>>> print(X.shape)
(10, 2)
>>> y
array([0, 1, 2, 0, 2, 2, 2, 1, 1, 0])

相關用法

注：本文由純淨天空篩選整理自rapids.ai大神的英文原創作品 cuml.datasets.make_blobs。非經特殊聲明，原始代碼版權歸原作者所有，本譯文未經允許或授權，請勿轉載或複製。

用法:

參數：

返回：

例子：