Hi all,

I have the prediction for my test set which are forecasted Value for "4/1/2020" 
for each match of "id" and "Group". I would like to add a fourth row to each 
group by (Group,id) in my train set and the values for this row should come 
from test set :

my train set: 

     structure(list(Date = c("1/1/2020", "2/1/2020", "3/1/2020", "1/1/2020", 
     "2/1/2020", "3/1/2020", "1/1/2020", "2/1/2020", "3/1/2020", ""
     ), Value = c(3.5, 2.7, 4, 2.5, 3.7, 0, 3, 0, 1, NA), Group = c("A", 
    "A", "A", "B", "B", "B", "C", "C", "C", ""), id = c(1L, 1L, 1L, 
    101L, 101L, 101L, 100L, 100L, 100L, NA)), class = "data.frame", row.names = 
c(NA, 
    -10L))

test set:

    structure(list(Date = c("4/1/2020", "4/1/2020", "4/1/2020", ""
      ), Value = c(3.5, 2.5, 3, NA), Group = c("A", "B", "C", ""), 
    id = c(1L, 101L, 100L, NA), value = c(0.2, 0.7, 0.9, NA)), class = 
"data.frame", row.names = c(NA, 
     -4L))structure(list(Date = c("4/1/2020", "4/1/2020", "4/1/2020", ""
    ), Value = c(3.5, 2.5, 3, NA), Group = c("A", "B", "C", ""), 
    id = c(1L, 101L, 100L, NA)), class = "data.frame", row.names = c(NA, 
    -4L))

desired output: 

    structure(list(Date = c("1/1/2020", "2/1/2020", "3/1/2020", "4/1/2020", 
    "1/1/2020", "2/1/2020", "3/1/2020", "4/1/2020", "1/1/2020", "2/1/2020", 
    "3/1/2020", "4/1/2020"), Value = c(3.5, 2.7, 4, 0.2, 2.5, 3.7, 
     0, 0.7, 3, 0, 1, 0.9), Group = c("A", "A", "A", "A", "B", "B", 
    "B", "B", "C", "C", "C", "C"), id = c(1L, 1L, 1L, 1L, 101L, 101L, 
    101L, 101L, 100L, 100L, 100L, 100L)), class = "data.frame", row.names = 
c(NA, 
   -12L))

Data is dummy and I have milions of records in original data set. 

Thanks for any help.
Elahe 
 

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to