我有数据集:
data = {'host': ['A','A','B','B'],'TS': ['1','2','3','7','9','11','8','14','16','18'],'Predict' : ['None','None','134','127','121','124']}
我想按非None值系列划分数据集并获取该系列的时差。
我具有时差功能。并尝试提取系列的索引,但不知道如何使用
def timediffs(series):
series['tdiff'] = series['ts'].diff().fillna(0.0)
return series
predict_index = df.index.where(df['Predict'].notna()).to_series().bfill()
最后,我想获得像这样的数据集:
new_data = {'host': ['A','19'],'124'],'Time_diff' : ['0','1','0',],'New_predict' : ['134','124',]
}
new_df = pd.DataFrame(new_data)