所以基本上我有一个称为df的数据框。 df有多个列,但我要重点关注的是DT(一个POSIXct
变量)和CITY(一个字符变量)。每个城市在DT列中都有不同数量的条目。我想创建第三列,分别是每个城市的每个时间顺序DT之间的间隔。因此,每个城市都有其自己的独立日期和日期间隔。
我创建了df的子集,以便更直观地了解正在处理的内容。
DT <- as.POSIXct(c("2019-11-02 20:00:00 CET","2019-11-02 19:00:00 CET","2019-11-02 20:00:00 CET","2019-11-03 19:30:00 CET","2019-11-04 19:00:00 CET","2019-11-05 19:30:00 CET","2019-11-05 19:00:00 CET","2019-11-05 20:00:00 CET","2019-11-06 19:30:00 CET","2019-11-06 20:30:00 CET","2019-11-06 19:00:00 CET","2019-11-08 19:30:00 CET","2019-11-08 20:30:00 CET","2019-11-08 20:00:00 CET","2019-11-08 19:00:00 CET","2019-11-09 20:00:00 CET","2019-11-10 21:30:00 CET","2019-11-10 19:30:00 CET","2019-11-10 18:00:00 CET","2019-11-10 21:00:00 CET","2019-11-11 19:30:00 CET","2019-11-11 22:30:00 CET","2019-11-12 21:00:00 CET","2019-11-12 19:00:00 CET" ))
CITY <- c("TOR","ORL","WAS","DAL","CLE","ATL","TOR","CLE")
df <- data.frame(DT,CITY)
df <- df %>% arrange(CITY)
df
我在下面创建的第三列是我想要的结果,而前两列是我目前拥有的。
days <- c(NA,1,2,NA,3,4,2)
df <- data.frame(df,days_since_last_entry)
df
任何帮助将不胜感激