向 Pandas 数据透视表添加过滤器

Question

提问by progster

I would like to add a filtering condition to a pivot table, like this:

我想向数据透视表添加过滤条件，如下所示：

(Select the values of v2 equal to 'A')

（选择 v2 的值等于 'A'）

pd.pivot_table(df,index=['v1'],columns=['v2'=='A'],values=['v3'],aggfunc='count')

Is that possible?

那可能吗？

Answer 1

回答by Josh Janjua

This is an extension of Grr'sanswer.

这是Grr答案的延伸。

Using their suggestion:

使用他们的建议：

pd.pivot_table(df[df.v3 == some_value], index='v1', columns='A', values='v3', aggfunc='count')

Produces an error:

产生错误：

"TypeError: pivot_table() got multiple values for argument 'values'"

“类型错误：pivot_table() 为参数‘值’获得了多个值”

I made a slight tweak, and it works for me:

我做了一个轻微的调整，它对我有用：

df[df.v3 == some_value].pivot_table(index='v1', columns='A', values='v3', aggfunc='count')

For adding multiple filters: Use &, |operators with a set of () to specify the priority. Using and,orresults error.

添加多个过滤器：使用&, | 运算符用一组 () 来指定优先级。使用and，或导致错误。

df[(df.v3 == some_value) & (df.v4 == some_value)].pivot_table(index='v1', columns='A', values='v3', aggfunc='count')

Answer 2

回答by Grr

If you want to filter by columns you could just pass a single column name, or list of names. For example:

如果您想按列过滤，您可以只传递一个列名或名称列表。例如：

pd.pivot_table(df, index='v1', columns='A', values='v3', aggfunc='count')
pd.pivot_table(df, index='v1', columns=['A', 'B', 'C'], values='v3', aggfunc='count')

If you want to filter by values you would just filter the DataFrame. For example:

如果您想按值过滤，您只需过滤 DataFrame。例如：

pd.pivot_table(df[df.v3 == some_value], index='v1', columns='A', values='v3', aggfunc='count')

Answer 3

回答by Vishnu Dhas

You can use a wherecondition as well here:

您也可以where在此处使用条件：

df.where([df.v3 == some_value]).pivot_table(index='v1', columns='A', values='v3', aggfunc='count')

向 Pandas 数据透视表添加过滤器

提问by progster

回答by Josh Janjua

回答by Grr

回答by Vishnu Dhas

相关推荐

最近更新

标签

向 Pandas 数据透视表添加过滤器

提问by progster

回答by Josh Janjua

回答by Grr

回答by Vishnu Dhas

相关推荐

pandas 检查列值是否在熊猫的其他列中

在 Python Pandas DataFrame 或 Jupyter Notebooks 中包装列名

从整个 Python Pandas 数据框中删除美元符号

pandas 将 DataFrame 导出和导入到 Access 文件 (.mdb)

相关推荐

最近更新

标签