Python 在 Pandas 数据框中使用 len()

Question

提问by Dong

This is the look of my DataFrame:

这是我的样子DataFrame：

   StateAb    GivenNm    Surname                  PartyNm PartyAb  ElectedOrder
35      WA        Joe    BULLOCK   Australian Labor Party     ALP             2
36      WA  Michaelia       CASH                  Liberal      LP             3
37      WA      Linda   REYNOLDS                  Liberal      LP             4
38      WA      Wayne  DROPULICH  Australian Sports Party    SPRT             5
39      WA      Scott     LUDLAM          The Greens (WA)     GRN             6

and I want to list a list of senators whose surname is more than 9 characters long.

我想列出一个姓氏超过 9 个字符的参议员名单。

So I think the code should be like this:

所以我觉得代码应该是这样的：

df[len(df.Surname) > 9]

but this raises a KeyError, where did I go wrong?

但这引发了一个KeyError，我哪里出错了？

Answer 1

回答by ayhan

The correct way to filter a DataFrame based on the length of strings in a column is

根据列中字符串的长度过滤 DataFrame 的正确方法是

df[df['Surname'].str.len() > 9]

df['Surname'].str.len()creates a Series of lengths for the surname column and df[df['Surname'].str.len() > 9]filters out the ones less than or equal to 9. What you did is to check the length of the Series itself (how many rows it has).

df['Surname'].str.len()为 surname 列创建一系列长度并df[df['Surname'].str.len() > 9]过滤掉小于或等于 9 的长度。您所做的是检查 Series 本身的长度（它有多少行）。

Answer 2

回答by Sytse Reitsma

Have a look at the python filterfunction. It does exactly what you want.

看看python过滤器功能。它完全符合您的要求。

df = [
    {"Surname": "Bullock-ish"},
    {"Surname": "Cash"},
    {"Surname": "Reynolds"},
]
longnames = list(filter(lambda s: len(s["Surname"]) > 9, df))
print(longnames)

>>[{'Surname': 'Bullock-ish'}]

Sytse

系统

Python 在 Pandas 数据框中使用 len()

提问by Dong

回答by ayhan

回答by Sytse Reitsma

相关推荐

最近更新

标签

Python 在 Pandas 数据框中使用 len()

提问by Dong

回答by ayhan

回答by Sytse Reitsma

相关推荐

Python 带有嵌入引号的文件名的“CSV 文件不存在”

Python 如何在 PyCharm 中设置环境变量？

Python 如何使用pymongo连接远程mongodb

如何比较 Python 中的枚举？

相关推荐

最近更新

标签