在 Pandas 中使用布尔掩码

Question

提问by elksie5000

This is probably a trivial query but I can't work it out.

这可能是一个微不足道的查询，但我无法解决。

Essentially, I want to be able to filter out noisy tweets from a dataframe below

本质上，我希望能够从下面的数据框中过滤掉嘈杂的推文

<class 'pandas.core.frame.DataFrame'>
Int64Index: 140381 entries, 0 to 140380
Data columns:
text          140381  non-null values
created_at    140381  non-null values
id            140381  non-null values
from_user     140381  non-null values
geo           5493  non-null values
dtypes: float64(1), object(4)

I can create a dataframe based on unwanted keywords thus:

我可以根据不需要的关键字创建一个数据框，因此：

junk = df[df.text.str.contains("Swans")]

But what's the best way to use this to see what's left?

但是，使用它来查看还剩下什么的最佳方法是什么？

Answer 1

回答by waitingkuo

df[~df.text.str.contains("Swans")]

Answer 2

回答by Mohamed Ali JAMAOUI

You can also use the following two options:

您还可以使用以下两个选项：

option 1:

选项1：

df[-df.text.str.contains("Swans")]

option 2:

选项2：

import numpy as np 
df[np.invert(df.text.str.contains("Swans"))]

在 Pandas 中使用布尔掩码

提问by elksie5000

回答by waitingkuo

回答by Mohamed Ali JAMAOUI

option 1:

选项1：

option 2:

选项2：

相关推荐

最近更新

标签

在 Pandas 中使用布尔掩码

提问by elksie5000

回答by waitingkuo

回答by Mohamed Ali JAMAOUI

option 1:

选项1：

option 2:

选项2：

相关推荐

pandas python 熊猫索引 is_unique 不起作用

pandas 日期字段的 cut/qcut 相当于什么？

在 Python Pandas DataFrame 中删除重复项而不删除重复项

pandas 用之前的非缺失值填充缺失的pandas数据，按key分组

相关推荐

最近更新

标签