pandas 如何通过不包含子字符串的单元格过滤熊猫数据框？

Question

提问by bpr

I want to filter a dataframe to find rows which do not contain the string 'site'.

我想过滤数据框以查找不包含字符串“站点”的行。

I know how to filter for rows which do contain 'site' but have not been able to get the reverse working. Here is what I have so far:

我知道如何过滤包含“站点”但无法反向工作的行。这是我到目前为止所拥有的：

def rbs(): #removes blocked sites
    frame = fill_rate()
    mask = frame[frame['Media'].str.contains('Site')==True]
    frame = (frame != mask)
    return frame

But this returns an error, of course.

但这当然会返回错误。

Answer 1

回答by EdChum

Just do frame[~frame['Media'].str.contains('Site')]

做就是了 frame[~frame['Media'].str.contains('Site')]

The ~negates the boolean condition

在~否定了布尔条件

So your method becomes:

所以你的方法变成：

def rbs(): #removes blocked sites
    frame = fill_rate()
    return frame[~frame['Media'].str.contains('Site')]

EDIT

编辑

it looks like you have NaNvalues judging by your errors so you have to filter these out first so your method becomes:

看起来你有NaN根据你的错误判断的值，所以你必须先过滤掉这些值，这样你的方法就变成了：

def rbs(): #removes blocked sites
    frame = fill_rate()
    frame = frame[frame['Media'].notnull()]
    return frame[~frame['Media'].str.contains('Site')]

the notnullwill filter out the missing values

在notnull将筛选出的遗漏值

pandas 如何通过不包含子字符串的单元格过滤熊猫数据框？

提问by bpr

回答by EdChum

相关推荐

最近更新

标签

pandas 如何通过不包含子字符串的单元格过滤熊猫数据框？

提问by bpr

回答by EdChum

相关推荐

Pandas 错误 - 遇到无效值

从通过 Pandas 创建的 html 表中删除边框

Python Pandas 时间序列插值和正则化

pandas 熊猫中的 NoneType 对象不是可迭代的错误

相关推荐

最近更新

标签