为 Pandas 中的多列赋值

Question

提问by SpanishBoy

I have follow simple DataFrame - df:

我遵循简单的 DataFrame - df：

Once I try to create a new columns and assign some values for them, as example below:

一旦我尝试创建一个新列并为它们分配一些值，如下例所示：

df['col2', 'col3'] = [(2,3), (2,3), (2,3)]

df['col2', 'col3'] = [(2,3), (2,3), (2,3)]

I got following structure

我得到了以下结构

   0 (col2, col3)
0  1    (2, 3)
1  2    (2, 3)
2  3    (2, 3)

However, I am looking a way to get as here:

但是，我正在寻找一种方法来达到这里：

   0 col2, col3
0  1    2,   3
1  2    2,   3
2  3    2,   3

Answer 1

回答by SpanishBoy

Looks like solution is simple:

看起来解决方案很简单：

df['col2'], df['col3'] = zip(*[(2,3), (2,3), (2,3)])

Answer 2

回答by jpp

There is a convenient solution to joining multiple series to a dataframe via a list of tuples. You can construct a dataframe from your list of tuples beforeassignment:

通过元组列表将多个系列连接到数据帧有一个方便的解决方案。您可以在分配之前从元组列表中构造一个数据框：

df = pd.DataFrame({0: [1, 2, 3]})
df[['col2', 'col3']] = pd.DataFrame([(2,3), (2,3), (2,3)])

print(df)

   0  col2  col3
0  1     2     3
1  2     2     3
2  3     2     3

This is convenient, for example, when you wish to join an arbitrary number of series.

这很方便，例如，当您希望加入任意数量的系列时。

Answer 3

回答by as - if

alternatively assigncan be used

或者assign可以使用

df.assign(col2 = 2, col3= 3)

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.assign.html

Answer 4

回答by elPastor

I ran across this issue when trying to apply multiple scalar values to multiple new columns and couldn't find a better way. If I'm missing something blatantly obvious, let me know, but df[['b','c']] = 0doesn't work. but here's the simplified code:

我在尝试将多个标量值应用于多个新列时遇到了这个问题，但找不到更好的方法。如果我遗漏了一些明显明显的内容，请告诉我，但df[['b','c']] = 0不起作用。但这是简化的代码：

# Create the "current" dataframe
df = pd.DataFrame({'a':[1,2]})

# List of columns I want to add
col_list = ['b','c']

# Quickly create key : scalar value dictionary
scalar_dict = { c : 0 for c in col_list }

# Create the dataframe for those columns - key here is setting the index = df.index
df[col_list] = pd.DataFrame(scalar_dict, index = df.index)

Or, what appears to be slightly faster is to use .assign():

或者，看起来稍微快一点的是使用.assign()：

df = df.assign(**scalar_dict)

为 Pandas 中的多列赋值

提问by SpanishBoy

回答by SpanishBoy

回答by jpp

回答by as - if

回答by elPastor

相关推荐

最近更新

标签

为 Pandas 中的多列赋值

提问by SpanishBoy

回答by SpanishBoy

回答by jpp

回答by as - if

回答by elPastor

相关推荐

pandas Python - 从数据框熊猫中检索过去 30 天的数据

pandas Groupby 并滞后数据帧的所有列？

pandas 在熊猫中找不到 Numexpr

将 Pandas 数据帧拆分为多个行数相同的数据帧

相关推荐

最近更新

标签