使用 python pandas 计算增量平均值

Question

提问by Jmc

I'd like to generate a series that's the incremental mean of a timeseries. Meaning that, starting from the first date (index 0), the mean stored in row x is the average of values [0:x]

我想生成一个序列，它是时间序列的增量平均值。这意味着，从第一个日期（索引 0）开始，存储在 x 行中的平均值是值 [0:x] 的平均值

data
index   value   mean          formula
0       4
1       5
2       6
3       7       5.5           average(0-3)
4       4       5.2           average(0-4)
5       5       5.166666667   average(0-5)
6       6       5.285714286   average(0-6)
7       7       5.5           average(0-7)

I'm hoping there's a way to do this without looping to take advantage of pandas.

我希望有一种方法可以在不循环利用Pandas的情况下做到这一点。

Answer 1

回答by jpobst

Here's an update for newer versions of Pandas (starting with 0.18.0)

这是 Pandas 新版本的更新（从 0.18.0 开始）

df['value'].expanding().mean()

or

或者

s.expanding().mean()

Answer 2

回答by Andy Hayden

As @TomAugspurger points out, you can use expanding_mean:

正如@TomAugspurger 指出的那样，您可以使用expanding_mean：

In [11]: s = pd.Series([4, 5, 6, 7, 4, 5, 6, 7])

In [12]: pd.expanding_mean(s, 4)
Out[12]: 
0         NaN
1         NaN
2         NaN
3    5.500000
4    5.200000
5    5.166667
6    5.285714
7    5.500000
dtype: float64

Answer 3

回答by patricksurry

Another approach is to use cumsum(), and divide by the cumulative number of items, for example:

另一种方法是使用 cumsum()，并除以项目的累积数量，例如：

In [1]:
    s = pd.Series([4, 5, 6, 7, 4, 5, 6, 7])
    s.cumsum() / pd.Series(np.arange(1, len(s)+1), s.index)

Out[1]:
0    4.000000
1    4.500000
2    5.000000
3    5.500000
4    5.200000
5    5.166667
6    5.285714
7    5.500000
dtype: float64

使用 python pandas 计算增量平均值

提问by Jmc

回答by jpobst

回答by Andy Hayden

回答by patricksurry

相关推荐

最近更新

标签

使用 python pandas 计算增量平均值

提问by Jmc

回答by jpobst

回答by Andy Hayden

回答by patricksurry

相关推荐

pandas 使用多索引在熊猫中添加小计列

pandas 使用pandas，计算Cramér的系数矩阵

pandas 根据列名称创建 DataFrame 的子集

pandas 将python pandas数据帧写入csv文件时出错

相关推荐

最近更新

标签