Pandas pivot_table 保留顺序

Question

提问by Rahul Ranjan

>>> df
   A   B   C      D
0  foo one small  1
1  foo one large  2
2  foo one large  2
3  foo two small  3
4  foo two small  3
5  bar one large  4
6  bar one small  5
7  bar two small  6
8  bar two large  7
>>> table = pivot_table(df, values='D', index=['A', 'B'],
...                     columns=['C'], aggfunc=np.sum)
>>> table
          small  large
foo  one  1      4
     two  6      NaN
bar  one  5      4
     two  6      7

I want the output to be as shown above, but I get a sorted output. bar comes above foo and so on.

我希望输出如上所示，但我得到一个排序的输出。bar 高于 foo 等等。

Answer 1

采纳答案by student

While creating pivot_table, the index is automatically sortedalphabetically. Not only fooand bar, you may also notice smalland largeis sorted. If you want fooon top, you may need to sortthem again using sortlevel. If you are expecting output as in example here, then sorting on Aand Cboth may be needed.

创建时pivot_table，索引会自动按字母顺序排序。不仅fooand bar，你可能还会注意到smallandlarge是排序的。如果你想foo在上面，你可能需要sort再次使用它们sortlevel。如果你期待输出作为例如这里，然后排序上A和C两个可能是必要的。

table.sortlevel(["A","B"], ascending= [False,True], sort_remaining=False, inplace=True)
table.sortlevel(["C"], axis=1, ascending=False,  sort_remaining=False, inplace=True)
print(table)

Output:

输出：

C        small  large
A   B                
foo one  1.0    4.0  
    two  6.0    NaN   
bar one  5.0    4.0  
    two  6.0    7.0

Update:

更新：

To remove index names A, Band C:

删除索引名称A，B以及C：

table.columns.name = None
table.index.names = (None, None)

Answer 2

回答by ayhan

I think pivot_table doesn't have an option for sorting, but groupby has:

我认为 pivot_table 没有排序选项，但 groupby 有：

df.groupby(['A', 'B', 'C'], sort=False)['D'].sum().unstack('C')
Out: 
C        small  large
A   B                
foo one    1.0    4.0
    two    6.0    NaN
bar one    5.0    4.0
    two    6.0    7.0

You pass the grouping columns to groupby and for the ones you want to show as column values, you use unstack.

您将分组列传递给 groupby，对于要显示为列值的列，您可以使用 unstack。

If you don't want the index names, rename them as None:

如果您不想要索引名称，请将它们重命名为 None：

df.groupby(['A', 'B', 'C'], sort=False)['D'].sum().rename_axis([None, None, None]).unstack(level=2)
Out: 
         small  large
foo one    1.0    4.0
    two    6.0    NaN
bar one    5.0    4.0
    two    6.0    7.0

Pandas pivot_table 保留顺序

提问by Rahul Ranjan

采纳答案by student

Update:

更新：

回答by ayhan

相关推荐

最近更新

标签

Pandas pivot_table 保留顺序

提问by Rahul Ranjan

采纳答案by student

Update:

更新：

回答by ayhan

相关推荐

pandas 为什么 pd.to_datetime 无法转换？

pandas 计算列中值的百分位数

pandas 合并熊猫列（一对多）

pandas 如何使用groupby计算vwap（成交量加权平均价格）并应用？

相关推荐

最近更新

标签