Menu
×
   ❮     
HTML CSS JAVASCRIPT SQL PYTHON JAVA PHP HOW TO W3.CSS C C++ C# BOOTSTRAP REACT MYSQL JQUERY EXCEL XML DJANGO NUMPY PANDAS NODEJS R TYPESCRIPT ANGULAR GIT POSTGRESQL MONGODB ASP AI GO KOTLIN SASS VUE DSA GEN AI SCIPY AWS CYBERSECURITY DATA SCIENCE
     ❯   

Pandas DataFrame drop_duplicates() 方法

❮ DataFrame 参考


示例

从 DataFrame 中移除重复行

import pandas as pd

data = {
  "name": ["Sally", "Mary", "John", "Mary"],
  "age": [50, 40, 30, 40],
  "qualified": [True, False, False, False]
}

df = pd.DataFrame(data)

newdf = df.drop_duplicates()
亲自尝试 »

定义和用法

drop_duplicates() 方法移除重复行。

如果在查找重复项时只考虑某些指定的列,请使用 subset 参数。


语法

dataframe.drop_duplicates(subset, keep, inplace, ignore_index)

参数

这些参数是 关键字参数

参数 描述
subset 列标签 可选。字符串或列表,包含查找重复项时要使用的列。如果未指定,则使用所有列。
keep 'first'
'last'
False
可选,默认为 'first'。指定保留哪个重复项。如果为 False,则删除所有重复项。
inplace True
False
可选,默认为 False。如果为 True:则在当前 DataFrame 上进行移除。如果为 False:则返回一个已进行移除操作的副本。
ignore_index True
False
可选,默认为 False。指定是否将索引标记为 0、1、2 等,或不标记。

返回值

包含结果的 DataFrame,如果 inplace 参数设置为 True,则返回 None。


❮ DataFrame 参考

×

Contact Sales

If you want to use W3Schools services as an educational institution, team or enterprise, send us an e-mail:
[email protected]

Report Error

If you want to report an error, or if you want to make a suggestion, send us an e-mail:
[email protected]

W3Schools is optimized for learning and training. Examples might be simplified to improve reading and learning. Tutorials, references, and examples are constantly reviewed to avoid errors, but we cannot warrant full correctness of all content. While using W3Schools, you agree to have read and accepted our terms of use, cookie and privacy policy.

Copyright 1999-2024 by Refsnes Data. All Rights Reserved. W3Schools is Powered by W3.CSS.