pandas 从整个数据框中删除一个字符

Question

提问by MJB

A common operation that I need to do with pandas is to read the table from an Excel file and then remove semicolons from all the fields. The columns are often in mixed data types and I run into AtributeError when trying to do something like this:

我需要对 Pandas 执行的一个常见操作是从 Excel 文件中读取表格，然后从所有字段中删除分号。这些列通常是混合数据类型，我在尝试执行以下操作时遇到了 AtributeError：

for col in cols_to_check:
    df[col] = df[col].map(lambda x: x.replace(';',''))

AttributeError: 'float' object has no attribute 'replace'

AttributeError: 'float' 对象没有属性 'replace'

when I wrap it in str()before replacing I have problems with Unicode characters, e.g.

当我str()在替换之前将它包装起来时，我遇到了 Unicode 字符的问题，例如

for col in cols_to_check:
    df[col] = df[col].map(lambda x: str(x).replace(';',''))

UnicodeEncodeError: 'ascii' codec can't encode character u'\xe9' in position 3: ordinal not in range(128)

UnicodeEncodeError: 'ascii' 编解码器无法对位置 3 中的字符 u'\xe9' 进行编码：序号不在范围内 (128)

In excel this is a very simple operation, all it takes is to replace ;with an empty string. How can I do it similarly in pandas for entire dataframe, disregard of data types? Or am I missing something?

在excel中这是一个非常简单的操作，只需;要用一个空字符串替换即可。我怎样才能在 Pandas 中对整个数据帧进行类似的操作，而不管数据类型？或者我错过了什么？

Answer 1

回答by jezrael

You can use DataFrame.replaceand for select use subset:

您可以使用DataFrame.replace和选择使用subset：

df = pd.DataFrame({'A':[1,2,3],
                   'B':[4,5,6],
                   'C':['f;','d:','sda;sd'],
                   'D':['s','d;','d;p'],
                   'E':[5,3,6],
                   'F':[7,4,3]})

print (df)
   A  B       C    D  E  F
0  1  4      f;    s  5  7
1  2  5      d:   d;  3  4
2  3  6  sda;sd  d;p  6  3

cols_to_check = ['C','D', 'E']

print (df[cols_to_check])
        C    D  E
0      f;    s  5
1      d:   d;  3
2  sda;sd  d;p  6

df[cols_to_check] = df[cols_to_check].replace({';':''}, regex=True)
print (df)
   A  B      C   D  E  F
0  1  4      f   s  5  7
1  2  5     d:   d  3  4
2  3  6  sdasd  dp  6  3

pandas 从整个数据框中删除一个字符

提问by MJB

回答by jezrael

相关推荐

最近更新

标签

pandas 从整个数据框中删除一个字符

提问by MJB

回答by jezrael

相关推荐

Pandas dataframe.query 方法语法

pandas 熊猫数据框分组并加入

如何用 Pandas 数据框中的 NaN 替换所有非数字条目？

pandas 如何为保存为 CSV 的数据框的非数字列中的每个元素添加引号“”

相关推荐

最近更新

标签