Hi Neha,

This looks like a feature engineering task. I think VectorSlicer can help with your case. Please refer to http://spark.apache.org/docs/latest/ml-features.html#vectorslicer .


2016-06-01 21:18 GMT+08:00 Neha Mehta <nehamehta.nm@gmail.com>:


I am performing Regression using Random Forest. In my input vector, I want the algorithm to ignore certain columns/features while training the classifier and also while prediction. These are basically Id columns. I checked the documentation and could not find any information on the same.

Request help with the same.

Thanks & Regards,