spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Yuhao Yang <hhb...@gmail.com>
Subject Re: Ignore features in Random Forest
Date Thu, 02 Jun 2016 06:22:14 GMT
Hi Neha,

This looks like a feature engineering task. I think VectorSlicer can help
with your case. Please refer to
http://spark.apache.org/docs/latest/ml-features.html#vectorslicer .

Regards,
Yuhao

2016-06-01 21:18 GMT+08:00 Neha Mehta <nehamehta.nm@gmail.com>:

> Hi,
>
> I am performing Regression using Random Forest. In my input vector, I want
> the algorithm to ignore certain columns/features while training the
> classifier and also while prediction. These are basically Id columns. I
> checked the documentation and could not find any information on the same.
>
> Request help with the same.
>
> Thanks & Regards,
> Neha
>

Mime
View raw message