首页 > 解决方案 > 如何将 Pandas DataFrame 日期索引从每日更改为每小时

问题描述

我有一个带有每天粒度的日期时间索引的熊猫数据框,我想在一天中的每个小时重复每一行,以便我的数据框现在每小时粒度。

2010-01-01  |  150
2010-01-02  |  200

2010-01-01 00:00:00 |  150
2010-01-01 01:00:00 |  150
2010-01-01 02:00:00 |  150
.
.
.
2010-01-01 23:00:00 |  150
2010-01-02 00:00:00 |  200
2010-01-02 01:00:00 |  200
.
.
.
2010-01-02 23:00:00 |  200

标签: pythonpandasdatetime

解决方案


DatetimeIndex必要时先创建:

df['date'] = pd.to_datetime(df['date'])
df = df.set_index('date')

创建所有可能的小时日期时间date_range,然后使用DataFrame.reindex

rng = pd.date_range(df.index.min(), df.index.max() + pd.Timedelta(23, 'H'), freq='H')
df2 = df.reindex(rng, method='ffill')

print (df2)
                       A
2010-01-01 00:00:00  150
2010-01-01 01:00:00  150
2010-01-01 02:00:00  150
2010-01-01 03:00:00  150
2010-01-01 04:00:00  150
2010-01-01 05:00:00  150
2010-01-01 06:00:00  150
2010-01-01 07:00:00  150
2010-01-01 08:00:00  150
2010-01-01 09:00:00  150
2010-01-01 10:00:00  150
2010-01-01 11:00:00  150
2010-01-01 12:00:00  150
2010-01-01 13:00:00  150
2010-01-01 14:00:00  150
2010-01-01 15:00:00  150
2010-01-01 16:00:00  150
2010-01-01 17:00:00  150
2010-01-01 18:00:00  150
2010-01-01 19:00:00  150
2010-01-01 20:00:00  150
2010-01-01 21:00:00  150
2010-01-01 22:00:00  150
2010-01-01 23:00:00  150
2010-01-02 00:00:00  200
2010-01-02 01:00:00  200
2010-01-02 02:00:00  200
2010-01-02 03:00:00  200
2010-01-02 04:00:00  200
2010-01-02 05:00:00  200
2010-01-02 06:00:00  200
2010-01-02 07:00:00  200
2010-01-02 08:00:00  200
2010-01-02 09:00:00  200
2010-01-02 10:00:00  200
2010-01-02 11:00:00  200
2010-01-02 12:00:00  200
2010-01-02 13:00:00  200
2010-01-02 14:00:00  200
2010-01-02 15:00:00  200
2010-01-02 16:00:00  200
2010-01-02 17:00:00  200
2010-01-02 18:00:00  200
2010-01-02 19:00:00  200
2010-01-02 20:00:00  200
2010-01-02 21:00:00  200
2010-01-02 22:00:00  200
2010-01-02 23:00:00  200

推荐阅读