Python Pandas：从多级列索引中删除一个级别？

Question

提问by David Wolever

If I've got a multi-level column index:

如果我有一个多级列索引：

>>> cols = pd.MultiIndex.from_tuples([("a", "b"), ("a", "c")])
>>> pd.DataFrame([[1,2], [3,4]], columns=cols)

    a
   ---+--
    b | c
--+---+--
0 | 1 | 2
1 | 3 | 4

How can I drop the "a" level of that index, so I end up with:

如何删除该索引的“a”级别，所以我最终得到：

    b | c
--+---+--
0 | 1 | 2
1 | 3 | 4

Answer 1

回答by DSM

You can use MultiIndex.droplevel:

您可以使用MultiIndex.droplevel：

>>> cols = pd.MultiIndex.from_tuples([("a", "b"), ("a", "c")])
>>> df = pd.DataFrame([[1,2], [3,4]], columns=cols)
>>> df
   a   
   b  c
0  1  2
1  3  4

[2 rows x 2 columns]
>>> df.columns = df.columns.droplevel()
>>> df
   b  c
0  1  2
1  3  4

[2 rows x 2 columns]

Answer 2

回答by sedeh

You could also achieve that by renaming the columns:

您还可以通过重命名列来实现：

df.columns = ['a', 'b']

This involves a manual step but could be an option especially if you would eventually rename your data frame.

这涉及一个手动步骤，但可能是一个选项，特别是如果您最终要重命名您的数据框。

Answer 3

回答by spacetyper

Another way to do this is to reassign dfbased on a cross section of df, using the .xsmethod.

另一种方法是使用.xs方法df基于的横截面重新分配。df

>>> df

    a
    b   c
0   1   2
1   3   4

>>> df = df.xs('a', axis=1, drop_level=True)

    # 'a' : key on which to get cross section
    # axis=1 : get cross section of column
    # drop_level=True : returns cross section without the multilevel index

>>> df

    b   c
0   1   2
1   3   4

Answer 4

回答by Mint

Another way to drop the index is to use a list comprehension:

另一种删除索引的方法是使用列表理解：

df.columns = [col[1] for col in df.columns]

   b  c
0  1  2
1  3  4

This strategy is also useful if you want to combine the names from both levels like in the example below where the bottom level contains two 'y's:

如果您想将两个级别的名称组合在一起，则此策略也很有用，如下例所示，其中底层包含两个 'y'：

cols = pd.MultiIndex.from_tuples([("A", "x"), ("A", "y"), ("B", "y")])
df = pd.DataFrame([[1,2, 8 ], [3,4, 9]], columns=cols)

   A     B
   x  y  y
0  1  2  8
1  3  4  9

Dropping the top level would leave two columns with the index 'y'. That can be avoided by joining the names with the list comprehension.

删除顶层会留下两列索引为“y”的列。这可以通过将名称与列表推导结合来避免。

df.columns = ['_'.join(col) for col in df.columns]

    A_x A_y B_y
0   1   2   8
1   3   4   9

That's a problem I had after doing a groupby and it took a while to find this other questionthat solved it. I adapted that solution to the specific case here.

这是我在进行 groupby 后遇到的一个问题，我花了一段时间才找到解决它的另一个问题。我在此处针对特定情况调整了该解决方案。

Answer 5

回答by dhFrank

I have struggled with this problem since I don't know why my droplevel() function does not work. Work through several and learn that ‘a' in your table is columns name and ‘b', ‘c' are index. Do like this will help

我一直在努力解决这个问题，因为我不知道为什么我的 droplevel() 函数不起作用。通过几个工作并了解表中的“a”是列名，而“b”、“c”是索引。这样做会有所帮助

df.columns.name = None
df.reset_index() #make index become label

Answer 6

回答by YOBEN_S

A small trick using sumwith level=1(work when level=1 is all unique)

使用sumlevel=1 的一个小技巧（当 level=1 都是唯一的时工作）

df.sum(level=1,axis=1)
Out[202]: 
   b  c
0  1  2
1  3  4

回答by jxc

As of Pandas 0.24.0, we can now use DataFrame.droplevel():

从 Pandas 0.24.0 开始，我们现在可以使用DataFrame.droplevel()：

cols = pd.MultiIndex.from_tuples([("a", "b"), ("a", "c")])
df = pd.DataFrame([[1,2], [3,4]], columns=cols)

df.droplevel(0, axis=1) 

#   b  c
#0  1  2
#1  3  4

This is very useful if you want to keep your DataFrame method-chain rolling.

如果您想保持 DataFrame 方法链滚动，这非常有用。

Answer 8

回答by Shubham Joshi

One line super simple answer:- df.columns=[df.columns.get_level_values(0)[i] + '_' + df.columns.get_level_values(1)[i] for i in range(0,len(df.columns.get_level_values(0)))]

一行超级简单的答案：- df.columns=[df.columns.get_level_values(0)[i] + '_' + df.columns.get_level_values(1)[i] for i in range(0,len(df. columns.get_level_values(0)))]

this will give you a data frame with:-

这将为您提供一个数据框：-

a_b b_c 0 1 2 1 3 4

Python Pandas：从多级列索引中删除一个级别？

提问by David Wolever

回答by DSM

回答by sedeh

回答by spacetyper

回答by Mint

回答by dhFrank

回答by YOBEN_S

回答by jxc

回答by Shubham Joshi

相关推荐

最近更新

标签

Python Pandas：从多级列索引中删除一个级别？

提问by David Wolever

回答by DSM

回答by sedeh

回答by spacetyper

回答by Mint

回答by dhFrank

回答by YOBEN_S

回答by jxc

回答by Shubham Joshi

相关推荐

Python 在 PyCharm 中重命名文件

Python 如何从安卓平板电脑访问我的 127.0.0.1:8000

Python 如何在一个带有美丽汤的div中选择一类div？

Python - 四舍五入到最接近的十

相关推荐

最近更新

标签