Hi,
   is there a way to take a dataset and extract numeric columns and create 
interaction columns from it automatically?

   For e.g.  there are 5 columns of data: A,B,C,D,E.

   CDE are numeric.

   Can someone provide code to automatically create more columns such as:

   1) C*D, C*E, C*D*E, (C+E)/(D+.01 (to avoid divide by zero), (D+E)/(C+.01 (to 
avoid divide by zero), (C+D)/(E+.01 (to avoid divide by zero))

?

I know in glm multiplying can create terms but i want the columns to be part of 
the data set so that i can feed this into Random forest to pick out predictive 
interaction terms as regression cannot reliably handle correlated interaction 
terms.

if anyone has some simple code that can do this that would be helpful.

thanks
Dhruv
    

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to