Pandas .factorize()方法通過標識不同的值來幫助獲得數組的數字表示形式。該方法可以同時使用pandas.factorize()
和Series.factorize()
。
參數:
values :1D sequence.
sort :[bool, Default is False] Sort uniques and shuffle labels.
na_sentinel:[ int, default -1] Missing Values to mark ‘not found’.返回: Numeric representation of array
代碼:解釋factorize()方法的用法
# importing libraries
import numpy as np
import pandas as pd
from pandas.api.types import CategoricalDtype
labels, uniques = pd.factorize(['b', 'd', 'd', 'c', 'a', 'c', 'a', 'b'])
print("Numeric Representation:\n", labels)
print("Unique Values:\n", uniques)
# sorting the numerics
label1, unique1 = pd.factorize(['b', 'd', 'd', 'c', 'a', 'c', 'a', 'b'],
sort = True)
print("\n\nNumeric Representation:\n", label1)
print("Unique Values:\n", unique1)
# Missing values indicated
label2, unique2 = pd.factorize(['b', None, 'd', 'c', None, 'a', ],
na_sentinel = -101)
print("\n\nNumeric Representation:\n", label2)
print("Unique Values:\n", unique2)
# When factorizing pandas object; unique will differ
a = pd.Categorical(['a', 'a', 'c'], categories =['a', 'b', 'c'])
label3, unique3 = pd.factorize(a)
print("\n\nNumeric Representation:\n", label3)
print("Unique Values:\n", unique3)
相關用法
注:本文由純淨天空篩選整理自Mohit Gupta_OMG 大神的英文原創作品 Python | Pandas.factorize()。非經特殊聲明,原始代碼版權歸原作者所有,本譯文未經允許或授權,請勿轉載或複製。