在 Pandas 中将一个时间序列插入另一个时间序列

Question

提问by elfnor

I have one set of values measured at regular times. Say:

我有一组定期测量的值。说：

import pandas as pd
import numpy as np
rng = pd.date_range('2013-01-01', periods=12, freq='H')
data = pd.Series(np.random.randn(len(rng)), index=rng)

And another set of more arbitrary times, for example, (in reality these times are not a regular sequence)

例如，另一组更随意的时间（实际上这些时间不是规则序列）

ts_rng = pd.date_range('2013-01-01 01:11:21', periods=7, freq='87Min')
ts = pd.Series(index=ts_rng)

I want to know the value of data interpolated at the times in ts.
I can do this in numpy:

我想知道在 ts 时间内插值的数据的值。
我可以在 numpy 中做到这一点：

x = np.asarray(ts_rng,dtype=np.float64)
xp = np.asarray(data.index,dtype=np.float64)
fp = np.asarray(data)
ts[:] = np.interp(x,xp,fp)

But I feel pandas has this functionality somewhere in resample, reindexetc. but I can't quite get it.

但我觉得 Pandas 在等的某个地方有这个功能 resample，reindex但我不太明白。

Answer 1

回答by Viktor Kerkez

You can concatenate the two time series and sort by index. Since the values in the second series are NaNyou can interpolateand the just select out the values that represent the points from the second series:

您可以连接两个时间序列并按索引排序。由于第二个系列中的值是NaN可以的interpolate，只需选择代表第二个系列中的点的值：

 pd.concat([data, ts]).sort_index().interpolate().reindex(ts.index)

or

或者

 pd.concat([data, ts]).sort_index().interpolate()[ts.index]

Answer 2

回答by tschm

Assume you would like to evaluate a time series ts on a different datetime_index. This index and the index of ts may overlap. I recommend to use the following groupby trick. This essentially gets rid of dubious double stamps. I then forward interpolate but feel free to apply more fancy methods

假设您想在不同的 datetime_index 上评估时间序列 ts。这个索引和 ts 的索引可能会重叠。我建议使用以下 groupby 技巧。这基本上摆脱了可疑的双重邮票。然后我向前插值但可以随意应用更多花哨的方法

def interpolate(ts, datetime_index):
    x = pd.concat([ts, pd.Series(index=datetime_index)])
    return x.groupby(x.index).first().sort_index().fillna(method="ffill")[datetime_index]

Answer 3

回答by ashkan

Here's a clean one liner:

这是一个干净的单衬：

ts = np.interp( ts_rng.asi8 ,data.index.asi8, data[0] )

在 Pandas 中将一个时间序列插入另一个时间序列

提问by elfnor

回答by Viktor Kerkez

回答by tschm

回答by ashkan

相关推荐

最近更新

标签

在 Pandas 中将一个时间序列插入另一个时间序列

提问by elfnor

回答by Viktor Kerkez

回答by tschm

回答by ashkan

相关推荐

pandas 熊猫：用一些 numpy 数组填充一列

如何在 pandas 的 crosstab/pivot_table 中使用两个不同的函数？

如何访问 Pandas DataFrame 中嵌入的 json 对象？

如何使用 Python Pandas 创建“yyyymmdd”格式的日期字符串列表？

相关推荐

最近更新

标签