spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Xiangrui Meng <men...@gmail.com>
Subject Re: Model characterization
Date Tue, 04 Nov 2014 05:03:02 GMT
We recently added metrics for regression:
https://github.com/apache/spark/blob/master/mllib/src/main/scala/org/apache/spark/mllib/evaluation/RegressionMetrics.scala
and you can use
https://github.com/apache/spark/blob/master/mllib/src/main/scala/org/apache/spark/mllib/evaluation/BinaryClassificationMetrics.scala
for ROC if it is a binary classification problem.

 -Xiangrui

On Mon, Nov 3, 2014 at 12:52 PM, Sameer Tilak <sstilak@live.com> wrote:
> Hi All,
>
> I have been using LinearRegression model of MLLib and very pleased with its
> scalability and robustness. Right now, we are just calculating MSE of our
> model. We would like to characterize the performance of our model. I was
> wondering adding support for computing things such as Confidence Interval
> etc. are  they something that are on the roadmap? Graphical things such as
> ROC curves etc. will that be supported by MLLib/other parts of the
> ecosystem? or is this something for which other statistical packages are
> recommended?

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
For additional commands, e-mail: user-help@spark.apache.org


Mime
View raw message