Python Pandas Index.drop_duplicates()用法及代碼示例

Python是進行數據分析的一種出色語言，主要是因為以數據為中心的python軟件包具有奇妙的生態係統。 Pandas是其中的一種，使導入和分析數據更加容易。

Pandas Index.drop_duplicates()函數返回刪除重複值的索引。該函數提供了選擇保留哪些重複值的靈活性。我們可以從列表中刪除所有重複值，或者保留重複值的第一個/最後一次出現。

用法： Index.drop_duplicates(labels, errors=’raise’)

參數:
keep:{'first'，'last'，False}，默認為'first'
->‘first’：除第一個匹配項外，刪除重複項。
->‘last’：除去最後一次出現的重複項。
->False：丟棄所有重複項。

返回:重複數據刪除：索引

範例1：采用Index.drop_duplicates()函數刪除除第一次出現以外的所有重複值出現。

# importing pandas as pd 
import pandas as pd 
  
# Creating the Index 
idx = pd.Index([10, 11, 5, 5, 22, 5, 3, 11]) 
  
# Print the Index 
idx

輸出：

讓我們在索引中刪除所有重複值出現的項目，但第一次出現的除外。

# drop all duplicate occurrences of the 
# labels and keep the first occurrence 
idx.drop_duplicates(keep ='first')

輸出：

正如我們在輸出中看到的，Index.drop_duplicate()函數已刪除索引中重複出現的標簽。

範例2：采用Index.drop_duplicate()函數刪除所有重複出現的標簽。不要在索引中保留任何重複的值。

# importing pandas as pd 
import pandas as pd 
  
# Creating the Index 
idx = pd.Index([10, 11, 5, 5, 22, 5, 3, 11]) 
  
# Print the Index 
idx

輸出：

讓我們將所有重複值都出現在索引中。

# drop all duplicate occurrences of the labels 
idx.drop_duplicates(keep = False)

輸出：

正如我們在輸出中看到的，所有重複值都已從索引中刪除。

相關用法

注：本文由純淨天空篩選整理自Shubham__Ranjan大神的英文原創作品 Python | Pandas Index.drop_duplicates()。非經特殊聲明，原始代碼版權歸原作者所有，本譯文未經允許或授權，請勿轉載或複製。