Python 如何按对象计算熊猫组列中的不同值？

Question

提问by Roman

I have a pandas data frame and group it by two columns (for example col1and col2). For fixed values of col1and col2(i.e. for a group) I can have several different values in the col3. I would like to count the number of distinct values from the third columns.

我有一个 Pandas 数据框，并按两列（例如col1和col2）将其分组。为固定值col1和col2（为一个基团，即）我可以在几个不同的值col3。我想计算第三列中不同值的数量。

For example, If I have this as my input:

例如，如果我有这个作为我的输入：

I would like to have this table (data frame) as the output:

我想将此表（数据框）作为输出：

Answer 1

采纳答案by Roman

df.groupby(['col1','col2'])['col3'].nunique().reset_index()

Answer 2

回答by Jeff

In [17]: df
Out[17]: 
    0  1  2
0   1  1  1
1   1  1  1
2   1  1  2
3   1  2  3
4   1  2  3
5   1  2  3
6   2  1  1
7   2  1  2
8   2  1  3
9   2  2  3
10  2  2  3
11  2  2  3

In [19]: df.groupby([0,1])[2].apply(lambda x: len(x.unique()))
Out[19]: 
0  1
1  1    2
   2    1
2  1    3
   2    1
dtype: int64

Python 如何按对象计算熊猫组列中的不同值？

提问by Roman

采纳答案by Roman

回答by Jeff

相关推荐

最近更新

标签

Python 如何按对象计算熊猫组列中的不同值？

提问by Roman

采纳答案by Roman

回答by Jeff

相关推荐

Python 将代码从 openCV 更新到 openCV2

如何将我的 Python 3 应用程序编译为 .exe？

Python 将数据帧保存到 pyspark 本地驱动器上的 JSON 文件

Python Pyplot - 自动将 x 轴范围设置为 min、max x 值传递给绘图函数

相关推荐

最近更新

标签