Python 总结 Pandas DataFrame 中的列值

Question

提问by Nyxynyx

In a pandas DataFrame, is it possible to collapse columns which have identical values, and sum up the values in another column?

在 Pandas DataFrame 中，是否可以折叠具有相同值的列，并对另一列中的值求和？

Code

代码

data = {"score":{"0":9.397,"1":9.397,"2":9.397995,"3":9.397996,"4":9.3999},"type":{"0":"advanced","1":"advanced","2":"advanced","3":"newbie","4":"expert"},"count":{"0":394.18930604,"1":143.14226729,"2":9.64172783,"3":0.1,"4":19.65413734}}
df = pd.DataFrame(data)
df

Output

输出

     count       score       type
0    394.189306  9.397000    advanced
1    143.142267  9.397000    advanced
2    9.641728    9.397995    advanced
3    0.100000    9.397996    newbie
4    19.654137   9.399900    expert

In the example above, the first two rows have the same scoreand type, so these rows should be merged together and their scores added up.

在上面的示例中，前两行具有相同的score和type，因此应将这些行合并在一起并将它们的分数相加。

Desired Output

期望输出

     count       score       type
0    537.331573  9.397000    advanced
1    9.641728    9.397995    advanced
2    0.100000    9.397996    newbie
3    19.654137   9.399900    expert

Answer 1

采纳答案by DSM

This is a job for groupby:

这是一份工作groupby：

>>> df.groupby(["score", "type"]).sum()
                        count
score    type                
9.397000 advanced  537.331573
9.397995 advanced    9.641728
9.397996 newbie      0.100000
9.399900 expert     19.6541374
>>> df.groupby(["score", "type"], as_index=False).sum()
      score      type       count
0  9.397000  advanced  537.331573
1  9.397995  advanced    9.641728
2  9.397996    newbie    0.100000
3  9.399900    expert   19.654137

Python 总结 Pandas DataFrame 中的列值

提问by Nyxynyx

采纳答案by DSM

相关推荐

最近更新

标签

Python 总结 Pandas DataFrame 中的列值

提问by Nyxynyx

采纳答案by DSM

相关推荐

如何永远运行 Python 程序？

Python Matplotlib pyplot 轴格式化程序

Python 无法比较天真和有意识的 datetime.now() <= challenge.datetime_end

Python Mac + virtualenv + pip + postgresql = 错误：找不到 pg_config 可执行文件

相关推荐

最近更新

标签