Python cuml.dask.preprocessing.LabelBinarizer用法及代码示例

用法:
class cuml.dask.preprocessing.LabelBinarizer(*, client=None, **kwargs)

LabelBinarizer 的分布式版本，用于 one-hot 编码标签集合。

例子：

创建一个带有标签的数组并对它们进行虚拟编码

import cupy as cp
import cupyx
from cuml.dask.preprocessing import LabelBinarizer

from dask_cuda import LocalCUDACluster
from dask.distributed import Client
import dask

cluster = LocalCUDACluster()
client = Client(cluster)

labels = cp.asarray([0, 5, 10, 7, 2, 4, 1, 0, 0, 4, 3, 2, 1],
                    dtype=cp.int32)
labels = dask.array.from_array(labels)

lb = LabelBinarizer()

encoded = lb.fit_transform(labels)

print(str(encoded.compute())

decoded = lb.inverse_transform(encoded)

print(str(decoded.compute())

输出：

[[1 0 0 0 0 0 0 0]
 [0 0 0 0 0 1 0 0]
 [0 0 0 0 0 0 0 1]
 [0 0 0 0 0 0 1 0]
 [0 0 1 0 0 0 0 0]
 [0 0 0 0 1 0 0 0]
 [0 1 0 0 0 0 0 0]
 [1 0 0 0 0 0 0 0]
 [1 0 0 0 0 0 0 0]
 [0 0 0 0 1 0 0 0]
 [0 0 0 1 0 0 0 0]
 [0 0 1 0 0 0 0 0]
 [0 1 0 0 0 0 0 0]]

 [ 0  5 10  7  2  4  1  0  0  4  3  2  1]

相关用法

注：本文由纯净天空筛选整理自rapids.ai大神的英文原创作品 cuml.dask.preprocessing.LabelBinarizer。非经特殊声明，原始代码版权归原作者所有，本译文未经允许或授权，请勿转载或复制。