Python 过滤时从熊猫数据框中获取子字符串

Question

提问by Eduardo

Say I have a dataframe with the following information:

假设我有一个包含以下信息的数据框：

Name    Points          String
John        24     FTS8500001A
Richard     35     FTS6700001B
John        29     FTS2500001A
Richard     35     FTS3800001B
John        34     FTS4500001A

Here is the way to get a DataFrame with the sample above:

以下是使用上述示例获取 DataFrame 的方法：

import pandas as pd
keys = ('Name', 'Points', 'String')
names = pd.Series(('John', 'Richard', 'John', 'Richard', 'John'))
ages = pd.Series((24,35,29,35,34))
strings = pd.Series(('FTS8500001A','FTS6700001B','FTS2500001A','FTS3800001B','FTS4500001A'))
df = pd.concat((names, ages, strings), axis=1, keys=keys)

I want to select every row that meet the following criteria: Name=Richard And Points=35. And for such rows I want to read the 4th and 5th char of the String column (the two numbers just after FTS).

我想选择满足以下条件的每一行：Name=Richard And Points=35。对于这样的行，我想读取 String 列的第 4 个和第 5 个字符（FTS 之后的两个数字）。

The output I want is the numbers 67 and 38.

我想要的输出是数字 67 和 38。

I've tried several ways to achieve it but with zero results. Can you please help?

我尝试了几种方法来实现它，但结果为零。你能帮忙吗？

Thank you very much.
Eduardo

非常感谢。
爱德华多

Answer 1

采纳答案by EdChum

Use a boolean mask to filter your df and then call strand slice the string:

使用布尔掩码过滤您的 df，然后调用str并切片字符串：

In [77]:
df.loc[(df['Name'] == 'Richard') & (df['Points']==35),'String'].str[3:5]

Out[77]:
1    67
3    38
Name: String, dtype: object

Answer 2

回答by firelynx

Pandas string methods

熊猫字符串方法

You can mask it on your criteria and then use pandas string methods

您可以根据您的条件屏蔽它，然后使用熊猫字符串方法

mask_richard = df.Name == 'Richard'
mask_points = df.Points == 35
df[mask_richard & mask_points].String.str[3:5]

1    67
3    38

Python 过滤时从熊猫数据框中获取子字符串

提问by Eduardo

采纳答案by EdChum

回答by firelynx

Pandas string methods

熊猫字符串方法

相关推荐

最近更新

标签

Python 过滤时从熊猫数据框中获取子字符串

提问by Eduardo

采纳答案by EdChum

回答by firelynx

Pandas string methods

熊猫字符串方法

相关推荐

Python 3 urllib 产生 TypeError: POST data should be bytes or an iterable of bytes。它不能是 str 类型

Python 使用 pip 安装时出错

Python 如何解决熊猫导入错误？

Python Numpy 数组：序列太大

相关推荐

最近更新

标签