pandas 按总和分组作为新列名

Question

提问by Adam

I am doing function where I am grouping by ID and summing the $ value associated with those IDs with this code for python:

我正在执行按 ID 分组的函数，并将与这些 ID 关联的 $ 值与此 Python 代码相加：

df = df.groupby([' Id'], as_index=False, sort=False)[["Amount"]].sum();

but it doesnt rename the column. As such I tried doing this :

但它不会重命名列。因此，我尝试这样做：

`df = df.groupby([' Id'], as_index=False, sort=False)`[["Amount"]].sum();.reset_index(name ='Total Amount')

but it gave me error that TypeError: reset_index() got an unexpected keyword argument 'name'

但它给我的错误是 TypeError: reset_index() 得到了一个意外的关键字参数 'name'

So I tried doing this finally following this post:Python Pandas Create New Column with Groupby().Sum()

所以我最终在这篇文章之后尝试这样做：Python Pandas Create New Column with Groupby().Sum()

df = df.groupby(['Id'])[["Amount"]].transform('sum');

but it still didnt work.

但它仍然没有工作。

What am I doing wrong?

我究竟做错了什么？

Answer 1

回答by jezrael

I think you need remove parameter as_index=Falseand use Series.reset_index, because this parameter return dfand then DataFrame.reset_indexwith parameter namefailed:

我认为你需要删除参数as_index=False并使用Series.reset_index，因为这个参数返回df然后DataFrame.reset_index参数name失败：

df = df.groupby('Id', sort=False)["Amount"].sum().reset_index(name ='Total Amount')

Or renamecolumn first:

或rename列第一：

d = {'Amount':'Total Amount'}
df = df.rename(columns=d).groupby('Id', sort=False, as_index=False)["Total Amount"].sum()

Sample:

样本：

df = pd.DataFrame({'Id':[1,2,2],'Amount':[10, 30,50]})
print (df)
   Amount  Id
0      10   1
1      30   2
2      50   2

df1 = df.groupby('Id', sort=False)["Amount"].sum().reset_index(name ='Total Amount')
print (df1)
   Id  Total Amount
0   1            10
1   2            80

d = {'Amount':'Total Amount'}
df1 = df.rename(columns=d).groupby('Id', sort=False, as_index=False)["Total Amount"].sum()
print (df1)
   Id  Total Amount
0   1            10
1   2            80

But if need new column with sumin original dfuse transformand assign output to new column:

但是，如果需要sum原始df使用的transform新列并将输出分配给新列：

df['Total Amount'] = df.groupby('Id', sort=False)["Amount"].transform('sum')
print (df)
   Amount  Id  Total Amount
0      10   1            10
1      30   2            80
2      50   2            80

pandas 按总和分组作为新列名

提问by Adam

回答by jezrael

相关推荐

最近更新

标签

pandas 按总和分组作为新列名

提问by Adam

回答by jezrael

相关推荐

pandas 如何在熊猫数据框中按组进行 t 检验？

pandas 如何使用pandas groupby()的split-apply-combine模式同时规范化多列

Python Pandas：字符串到日期时间

Pandas：拆分一个字符串然后创建一个新列？

相关推荐

最近更新

标签