pandas 如何一次将函数应用于熊猫数据框中的多列

Question

提问by yoshiserry

I frequently deal with data which is poorly formatted (I.e. number fields are not consistent etc)

我经常处理格式不佳的数据（即数字字段不一致等）

There may be other ways, which I am not aware of but the way I format a single column in a dataframe is by using a function and mapping the column to that function.

可能还有其他方法，我不知道，但我格式化数据框中单个列的方式是使用函数并将该列映射到该函数。

format = df.column_name.map(format_number)

Question: 1 - what if I have a dataframe with 50 columns, and want to apply that formatting to multiple columns, etc column 1, 3, 5, 7, 9,

问题：1 - 如果我有一个包含 50 列的数据框，并且想要将该格式应用于多列等第 1、3、5、7、9 列，该怎么办？

Can you go:

你可以去吗：

format = df.1,3,5,9.map(format_number)

.. This way I could format all my number columns in one line?

.. 这样我可以在一行中格式化所有数字列吗？

Answer 1

回答by BrenBarn

You can do df[['Col1', 'Col2', 'Col3']].applymap(format_number). Note, though that this will return new columns; it won't modify the existing DataFrame. If you want to put the values back in the original, you'll have to do df[['Col1', 'Col2', 'Col3']] = df[['Col1', 'Col2', 'Col3']].applymap(format_number).

你可以做到df[['Col1', 'Col2', 'Col3']].applymap(format_number)。请注意，尽管这将返回新列；它不会修改现有的 DataFrame。如果要将值放回原始值，则必须执行df[['Col1', 'Col2', 'Col3']] = df[['Col1', 'Col2', 'Col3']].applymap(format_number).

Answer 2

回答by EdChum

You could use applylike this:

你可以这样使用apply：

df.apply(lambda row: format_number(row), axis=1)

You would need to specify the columns though in your format_numberfunction:

您需要在format_number函数中指定列：

def format_number(row):
    row['Col1'] = doSomething(row['Col1']
    row['Col2'] = doSomething(row['Col2'])
    row['Col3'] = doSomething(row['Col3'])

This is not as elegant as @BrenBarn's answer but it has an advantage that the dataframe is modified in place so you don't need to assign the columns back again

这不像@BrenBarn 的回答那么优雅，但它的优点是数据框被修改到位，因此您不需要再次分配列

pandas 如何一次将函数应用于熊猫数据框中的多列

提问by yoshiserry

回答by BrenBarn

回答by EdChum

相关推荐

最近更新

标签

pandas 如何一次将函数应用于熊猫数据框中的多列

提问by yoshiserry

回答by BrenBarn

回答by EdChum

相关推荐

Pandas DataFrame ApplyMap 方法

pandas 快速熊猫过滤

Pandas for 循环分组

无法在 Pandas 数据框中用零填充 NaN

相关推荐

最近更新

标签