根据包含 Pandas 中特定字符串的列名选择列

Question

提问by Eric B

I created a dataframe using the following:

我使用以下内容创建了一个数据框：

df = pd.DataFrame(np.random.rand(10, 3), columns=['alp1', 'alp2', 'bet1'])

I'd like to get a dataframe containing every columns from dfthat have alpin their names. This is only a light version of my problem, so my real dataframe will have more columns.

我想获得含有从每列的数据帧df具有alp在他们的名字。这只是我的问题的一个简单版本，所以我的真实数据框会有更多的列。

Answer 1

回答by MaxU

alternative methods:

替代方法：

In [13]: df.loc[:, df.columns.str.startswith('alp')]
Out[13]:
       alp1      alp2
0  0.357564  0.108907
1  0.341087  0.198098
2  0.416215  0.644166
3  0.814056  0.121044
4  0.382681  0.110829
5  0.130343  0.219829
6  0.110049  0.681618
7  0.949599  0.089632
8  0.047945  0.855116
9  0.561441  0.291182

In [14]: df.loc[:, df.columns.str.contains('alp')]
Out[14]:
       alp1      alp2
0  0.357564  0.108907
1  0.341087  0.198098
2  0.416215  0.644166
3  0.814056  0.121044
4  0.382681  0.110829
5  0.130343  0.219829
6  0.110049  0.681618
7  0.949599  0.089632
8  0.047945  0.855116
9  0.561441  0.291182

Answer 2

回答by piRSquared

option 1
Full numpy+ pd.DataFrame

选项 1
完整numpy+pd.DataFrame

m = np.core.defchararray.find(df.columns.values.astype(str), 'alp') >= 0
pd.DataFrame(df.values[:, m], df.index, df.columns[m])

       alp1      alp2
0  0.819189  0.356867
1  0.900406  0.968947
2  0.201382  0.658768
3  0.700727  0.946509
4  0.176423  0.290426
5  0.132773  0.378251
6  0.749374  0.983251
7  0.768689  0.415869
8  0.292140  0.457596
9  0.214937  0.976780

option 2
numpy+ loc

选项 2
numpy+loc

m = np.core.defchararray.find(df.columns.values.astype(str), 'alp') >= 0
df.loc[:, m]

       alp1      alp2
0  0.819189  0.356867
1  0.900406  0.968947
2  0.201382  0.658768
3  0.700727  0.946509
4  0.176423  0.290426
5  0.132773  0.378251
6  0.749374  0.983251
7  0.768689  0.415869
8  0.292140  0.457596
9  0.214937  0.976780

timing
numpyis faster

时间
numpy更快

Answer 3

回答by CONvid19

You've several options, here's a couple:

您有多种选择，这里有几个：

1 - filterwith like:

1 -filter与like：

df.filter(like='alp')

2 - filterwith regex:

2 -filter与regex：

df.filter(regex='alp')

Answer 4

回答by Harvey

In case @Pedro answer doesn't work here is official way of doing it for pandas 0.25

如果@Pedro 的回答在这里不起作用，这是为Pandas 0.25 做的官方方法

Sample dataframe:

示例数据框：

>>> df = pd.DataFrame(np.array(([1, 2, 3], [4, 5, 6])),
...                   index=['mouse', 'rabbit'],
...                   columns=['one', 'two', 'three'])

         one two three
mouse     1   2   3
rabbit    4   5   6

         one two three
mouse     1   2   3
rabbit    4   5   6

Select columns by name

按名称选择列

df.filter(items=['one', 'three'])
         one  three
mouse     1      3
rabbit    4      6

Select columns by regular expression

通过正则表达式选择列

df.filter(regex='e$', axis=1) #ending with *e*, for checking containing just use it without *$* in the end
         one  three
mouse     1      3
rabbit    4      6

Select rows containing 'bbi'

选择包含 'bbi' 的行

df.filter(like='bbi', axis=0)
         one  two  three
rabbit    4    5      6

根据包含 Pandas 中特定字符串的列名选择列

提问by Eric B

回答by MaxU

回答by piRSquared

回答by CONvid19

回答by Harvey

Select columns by name

按名称选择列

Select columns by regular expression

通过正则表达式选择列

Select rows containing 'bbi'

选择包含 'bbi' 的行

相关推荐

最近更新

标签

根据包含 Pandas 中特定字符串的列名选择列

提问by Eric B

回答by MaxU

回答by piRSquared

回答by CONvid19

回答by Harvey

Select columns by name

按名称选择列

Select columns by regular expression

通过正则表达式选择列

Select rows containing 'bbi'

选择包含 'bbi' 的行

相关推荐

pandas：列的长度必须与键的长度相同

pandas 类型错误：输入类型不支持 ufunc 'isnan'，-seaborn 热图

pandas 如何在python中绘制数据透视图？

pandas 使用 json_normalize 压平嵌套的 json

相关推荐

最近更新

标签