spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ulanov, Alexander" <alexander.ula...@hp.com>
Subject RE: Prediction using Classification with text attributes in Apache Spark MLLib
Date Tue, 24 Jun 2014 11:49:32 GMT
Hi Imk,

There is a number of libraries and scripts to convert text to libsvm format, if you just type
" libsvm format converter" in search engine. Unfortunately I cannot recommend a specific one,
except the one that is built in Weka. I use it for test purposes, and for big experiments
it is easier to write your own converter. Format is simple enough. However, I hope that such
tool will be implemented in Spark MLLib someday, because it will benefit from parallel processing.

Best regards, Alexander

-----Original Message-----
From: lmk [mailto:lakshmi.muralikrishnan@gmail.com] 
Sent: Tuesday, June 24, 2014 3:41 PM
To: user@spark.incubator.apache.org
Subject: RE: Prediction using Classification with text attributes in Apache Spark MLLib

Hi Alexander,
Thanks for your prompt response. Earlier I was executing this Prediction using Weka only.
But now we are moving to a huge dataset and hence to Apache Spark MLLib. Is there any other
way to convert to libSVM format? Or is there any other simpler algorithm that I can use in
mllib?

Thanks,
lmk



--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Prediction-using-Classification-with-text-attributes-in-Apache-Spark-MLLib-tp8166p8168.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

Mime
View raw message