Python 熊猫从字符串中提取数字

Question

提问by Dance Party

Given the following data frame:

给定以下数据框：

import pandas as pd
import numpy as np
df = pd.DataFrame({'A':['1a',np.nan,'10a','100b','0b'],
                   })
df

    A
0   1a
1   NaN
2   10a
3   100b
4   0b

I'd like to extract the numbers from each cell (where they exist). The desired result is:

我想从每个单元格（它们存在的地方）中提取数字。想要的结果是：

I know it can be done with str.extract, but I'm not sure how.

我知道它可以用来完成str.extract，但我不确定如何。

Answer 1

回答by Jon Clements

Give it a regex capture group:

给它一个正则表达式捕获组：

df.A.str.extract('(\d+)')

Gives you:

给你：

0      1
1    NaN
2     10
3    100
4      0
Name: A, dtype: object

Answer 2

回答by Taming

To answer @Steven G 's question in the comment above, this should work:

要在上面的评论中回答@Steven G 的问题，这应该有效：

df.A.str.extract('(^\d*)')

Python 熊猫从字符串中提取数字

提问by Dance Party

回答by Jon Clements

回答by Taming

相关推荐

最近更新

标签

Python 熊猫从字符串中提取数字

提问by Dance Party

回答by Jon Clements

回答by Taming

相关推荐

python请求带有标题和参数的POST

Python 如何在 keras 中实现自定义指标？

如何使用自制软件在 macOS 中安装以前版本的 Python 3？

Python 获取命令行参数作为字符串

相关推荐

最近更新

标签