Python Pandas Index.drop_duplicates()用法及代码示例

Python是进行数据分析的一种出色语言，主要是因为以数据为中心的python软件包具有奇妙的生态系统。 Pandas是其中的一种，使导入和分析数据更加容易。

Pandas Index.drop_duplicates()函数返回删除重复值的索引。该函数提供了选择保留哪些重复值的灵活性。我们可以从列表中删除所有重复值，或者保留重复值的第一个/最后一次出现。

用法： Index.drop_duplicates(labels, errors=’raise’)

参数:
keep:{'first'，'last'，False}，默认为'first'
->‘first’：除第一个匹配项外，删除重复项。
->‘last’：除去最后一次出现的重复项。
->False：丢弃所有重复项。

返回:重复数据删除：索引

范例1：采用Index.drop_duplicates()函数删除除第一次出现以外的所有重复值出现。

# importing pandas as pd 
import pandas as pd 
  
# Creating the Index 
idx = pd.Index([10, 11, 5, 5, 22, 5, 3, 11]) 
  
# Print the Index 
idx

输出：

让我们在索引中删除所有重复值出现的项目，但第一次出现的除外。

# drop all duplicate occurrences of the 
# labels and keep the first occurrence 
idx.drop_duplicates(keep ='first')

输出：

正如我们在输出中看到的，Index.drop_duplicate()函数已删除索引中重复出现的标签。

范例2：采用Index.drop_duplicate()函数删除所有重复出现的标签。不要在索引中保留任何重复的值。

# importing pandas as pd 
import pandas as pd 
  
# Creating the Index 
idx = pd.Index([10, 11, 5, 5, 22, 5, 3, 11]) 
  
# Print the Index 
idx

输出：

让我们将所有重复值都出现在索引中。

# drop all duplicate occurrences of the labels 
idx.drop_duplicates(keep = False)

输出：

正如我们在输出中看到的，所有重复值都已从索引中删除。

相关用法

注：本文由纯净天空筛选整理自Shubham__Ranjan大神的英文原创作品 Python | Pandas Index.drop_duplicates()。非经特殊声明，原始代码版权归原作者所有，本译文未经允许或授权，请勿转载或复制。