pandas 替换熊猫数据框中的字符串

Question

提问by DJK

I have a dataframe with multiple columns. I want to look at one column and if any of the strings in the column contain @, I want to replace them with another string. How would I go about doing this?

我有一个包含多列的数据框。我想查看一列，如果该列中的任何字符串包含@，我想用另一个字符串替换它们。我该怎么做呢？

Answer 1

回答by Bill Harper

A dataframe in pandas is composed of columns which are series - Panda docs link

Pandas 中的数据框由系列的列组成 - Panda docs link

I'm going to use regex, because it's useful and everyone needs practice, myself included! Panda docs for text manipulation

我将使用正则表达式，因为它很有用，每个人都需要练习，包括我自己！用于文本操作的 Panda 文档

Note the str.replace. The regexstring you want is this (it worked for me): '.*@+.*' which says "any character (.) zero or more times (*), followed by an @ 1 or more times (+) followed by any character (.) zero or more times (*)

注意 str.replace。您想要的正则表达式字符串是这样的（它对我有用）：'.*@+.*' 表示“任何字符 (.) 零次或多次 (*)，然后是 @ 1 次或多次 (+)通过任何字符 (.) 零次或多次 (*)

df['column'] = df['column'].str.replace('.*@+.*', 'replacement')

Should work, where 'replacement' is whatever string you want to put in.

应该可行，其中“替换”是您要放入的任何字符串。

Answer 2

回答by ranlot

Assuming you called your dataframe df, you can do:

假设您调用了 dataframe df，您可以执行以下操作：

pd.DataFrame(map(lambda col: map(lambda x: 'anotherString' if '@' in x else x, df[col]), df.columns)).transpose()

Answer 3

回答by Ezer K

My suggestion:

我的建议：

df['col'] = ['new string' if '@' in x else x for x in df['col']]

not sure which is faster.

不确定哪个更快。

pandas 替换熊猫数据框中的字符串

提问by DJK

回答by Bill Harper

回答by ranlot

回答by Ezer K

相关推荐

最近更新

标签

pandas 替换熊猫数据框中的字符串

提问by DJK

回答by Bill Harper

回答by ranlot

回答by Ezer K

相关推荐

pandas 聚合数据并获得总和和计数

pandas 选择特定列仅形成 Python 中的数据框

Pandas：保存到 excel 编码问题

时间序列分析 - 不均匀间隔的措施 - pandas + statsmodels

相关推荐

最近更新

标签