Pandas Dataframe：获取最大元素的索引

Question

提问by JNevens

I am looking for a way to get both the index and the column of the maximum element in a Pandas DataFrame. Thus far, this is my code:

我正在寻找一种方法来获取 Pandas DataFrame 中最大元素的索引和列。到目前为止，这是我的代码：

idx = range(0, 50, 5)
col = range(0, 50, 5)
scores = pd.DataFrame(np.zeros((len(idx), len(col))), index=idx, columns=col, dtype=float)
scores.loc[11, 16] = 5 #Assign a random element

This gives me the following DataFrame:

这给了我以下数据帧：

  | 1   6   11  16  21  26  31  36  41  46
------------------------------------------
1 | 0   0   0   0   0   0   0   0   0   0
6 | 0   0   0   0   0   0   0   0   0   0
11| 0   0   0   5   0   0   0   0   0   0
16| 0   0   0   0   0   0   0   0   0   0
21| 0   0   0   0   0   0   0   0   0   0
26| 0   0   0   0   0   0   0   0   0   0
31| 0   0   0   0   0   0   0   0   0   0
36| 0   0   0   0   0   0   0   0   0   0
41| 0   0   0   0   0   0   0   0   0   0
46| 0   0   0   0   0   0   0   0   0   0

After that, I use the unstackmethod:

之后，我使用以下unstack方法：

unstacked = scores.unstack().copy()
unstacked.sort(ascending=False)

This gives me:

这给了我：

16  11    5
46  46    0
16  31    0
11  31    0
    36    0
...

How can I get the index and column of the maximum value? I would like to get something along the lines of an array or tuple containing (16, 11).

如何获取最大值的索引和列？我想得到一些类似于包含(16, 11).

Answer 1

回答by fixxxer

You are looking for idxmax:

您正在寻找idxmax：

In [1332]: x
Out[1332]: 
   1  6  11  16  21  26  31  36  41  46
0  0  0   0   0   0   0   0   0   0   0
1  0  0   0   0   0   0   0   0   0   0
2  0  0   5   0   0   0   0   0   0   0
3  0  0   0   0   0   0   0   0   0   0
4  0  0   0   0   0   0   0   0   0   0
5  0  0   0   0   0   0   0   0   0   0
6  0  0   0   0   0   0   0   0   0   0
7  0  0   0   0   0   0   0   0   0   0
8  0  0   0   0   0   0   0   0   0   0
9  0  0   0   0   0   0   0   0   0   0

Row of the max value:

最大值的行：

In [1337]: max(x.idxmax())
Out[1337]: 2

Column of the max value (too many maxs):

最大值的列（太多maxs）：

In [1359]: x.max()[x.max() == x.max(axis=1).max()].index
Out[1359]: Index([u'11'], dtype='object')

Answer 2

回答by mik

x.max()[x.max() == x.max(axis=1).max()].index

This works to get the column but max(x.idxmax())only returns the numerical maximum of the indices themselves, not the index of the maximum value in the table (just got lucky in this example because everything else is 0's). An alternative is:

这适用于获取列，但max(x.idxmax())只返回索引本身的数值最大值，而不是表中最大值的索引（在本例中很幸运，因为其他一切都是 0）。另一种选择是：

s = x.max()[x.max() == x.max(index=1).max()].index
s = str(s[0])
max_index = x.idxmax()[s]

Pandas Dataframe：获取最大元素的索引

提问by JNevens

回答by fixxxer

回答by mik

相关推荐

最近更新

标签

Pandas Dataframe：获取最大元素的索引

提问by JNevens

回答by fixxxer

回答by mik

相关推荐

使用 Python Pandas 使用每日数据的月平均值

Pandas 堆叠条形图和分组条形图

使用 Pandas 聚合所有数据帧行对组合

pandas 熊猫数据框有条件的 .mean() 取决于特定列中的值

相关推荐

最近更新

标签