我有以下 df:
df<-data.frame(geo_num=c(11,12,22,41,42,43,77,71),cust_id=c("A","A","B","C","D","D"),sales=c(2,3,2,1,4,6,3))
> df
geo_num cust_id sales
1 11 A 2
2 12 A 3
3 22 B 2
4 41 C 1
5 42 C 2
6 43 C 4
7 77 D 6
8 71 D 3
需要创建一个新列 'geo_num_new',其中每个组的 'cust_id' 都具有来自 'geo_num' 的第一个值,如下所示:
> df_new
geo_num cust_id sales geo_num_new
1 11 A 2 11
2 12 A 3 11
3 22 B 2 22
4 41 C 1 41
5 42 C 2 41
6 43 C 4 41
7 77 D 6 77
8 71 D 3 77
谢谢。