Hi Neha, This looks like a feature engineering task. I think VectorSlicer can help with your case. Please refer to http://spark.apache.org/docs/latest/ml-features.html#vectorslicer .
Regards, Yuhao 2016-06-01 21:18 GMT+08:00 Neha Mehta <nehamehta...@gmail.com>: > Hi, > > I am performing Regression using Random Forest. In my input vector, I want > the algorithm to ignore certain columns/features while training the > classifier and also while prediction. These are basically Id columns. I > checked the documentation and could not find any information on the same. > > Request help with the same. > > Thanks & Regards, > Neha >