pandas 根据列名拆分pandas数据框

Question

提问by Segmented

Is there a way to split a pandas data frame based on the column name? As an example consider the data frame has the following columns df = ['A_x', 'B_x', 'C_x', 'A_y', 'B_y', 'C_y']and I want to create two data frames X = ['A_x', 'B_x', 'C_x']and Y = ['A_y', 'B_y', 'C_y'].

有没有办法根据列名拆分Pandas数据框？例如，考虑数据框具有以下列df = ['A_x', 'B_x', 'C_x', 'A_y', 'B_y', 'C_y']，我想创建两个数据框X = ['A_x', 'B_x', 'C_x']和Y = ['A_y', 'B_y', 'C_y'].

I know there is a possibility to do this:

我知道有可能这样做：

d = {'A': df.A_x, 'B': df.B_x, 'C': df.B_x}
X = pd.DataFrame (data=d)

but this would not be ideal as in my case I have 2200 columns in df. Is there a more elegant solution?

但这并不理想，因为在我的情况下，我在df. 有没有更优雅的解决方案？

Answer 1

回答by unutbu

You could use df.filter(regex=...):

你可以使用df.filter(regex=...)：

import numpy as np
import pandas as pd
df = pd.DataFrame(np.random.randn(2, 10),
                  columns='Time A_x A_y A_z B_x B_y B_z C_x C_y C-Z'.split())
X = df.filter(regex='_x')
Y = df.filter(regex='_y')

yields

产量

In [15]: X
Out[15]: 
        A_x       B_x       C_x
0 -0.706589  1.031368 -0.950931
1  0.727826  0.879408 -0.049865

In [16]: Y
Out[16]: 
        A_y       B_y       C_y
0 -0.663647  0.635540 -0.532605
1  0.326718  0.189333 -0.803648

pandas 根据列名拆分pandas数据框

提问by Segmented

回答by unutbu

相关推荐

最近更新

标签

pandas 根据列名拆分pandas数据框

提问by Segmented

回答by unutbu

相关推荐

pandas 如何使用pandas从csv读取特定的列索引

Pandas DataFrame.groupby 包括索引

pandas 在应用函数pandas python中包含组名

从 Pandas DataFrame 中的列创建一个元组

相关推荐

最近更新

标签