计算 Pandas GroupBy 上的任意百分位数

Question

提问by Alex Rothberg

Currently there is a medianmethod on the Pandas's GroupByobjects.

目前有一个median关于 PandasGroupBy对象的方法。

Is there is a way to calculate an arbitrary percentile(see: http://docs.scipy.org/doc/numpy-dev/reference/generated/numpy.percentile.html) on the groupings?

有没有办法计算分组上的任意值percentile（参见：http: //docs.scipy.org/doc/numpy-dev/reference/generated/numpy.percentile.html）？

Median would be the calcuation of percentile with q=50.

中位数将是百分位数的计算q=50。

Answer 1

回答by TomAugspurger

You want the quantilemethod:

你想要的quantile方法：

In [47]: df
Out[47]: 
           A         B    C
0   0.719391  0.091693  one
1   0.951499  0.837160  one
2   0.975212  0.224855  one
3   0.807620  0.031284  one
4   0.633190  0.342889  one
5   0.075102  0.899291  one
6   0.502843  0.773424  one
7   0.032285  0.242476  one
8   0.794938  0.607745  one
9   0.620387  0.574222  one
10  0.446639  0.549749  two
11  0.664324  0.134041  two
12  0.622217  0.505057  two
13  0.670338  0.990870  two
14  0.281431  0.016245  two
15  0.675756  0.185967  two
16  0.145147  0.045686  two
17  0.404413  0.191482  two
18  0.949130  0.943509  two
19  0.164642  0.157013  two

In [48]: df.groupby('C').quantile(.95)
Out[48]: 
            A         B
C                      
one  0.964541  0.871332
two  0.826112  0.969558

Answer 2

回答by Anshuman Goel

I found another useful solution here

我在这里找到了另一个有用的解决方案

If I have to use groupbyanother approach can be:

如果我必须使用groupby另一种方法可以是：

def percentile(n):
    def percentile_(x):
        return np.percentile(x, n)
    percentile_.__name__ = 'percentile_%s' % n
    return percentile_

Using the below call, I am able to achieve the same result as the solution given by @TomAugspurger

使用下面的调用，我能够获得与@TomAugspurger 给出的解决方案相同的结果

df.groupby('C').agg([percentile(50), percentile(95)])

计算 Pandas GroupBy 上的任意百分位数

提问by Alex Rothberg

回答by TomAugspurger

回答by Anshuman Goel

相关推荐

最近更新

标签

计算 Pandas GroupBy 上的任意百分位数

提问by Alex Rothberg

回答by TomAugspurger

回答by Anshuman Goel

相关推荐

图像以适应 WPF 中的网格单元格大小

如何使 WPF 窗口响应

WPF/XAML：如何使 TextBlock 中的所有文本大写？

pandas 根据pandas中的另一个列值有条件地填充列值

相关推荐

最近更新

标签