spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sean Owen (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-1406) PMML model evaluation support via MLib
Date Thu, 03 Apr 2014 12:00:19 GMT

    [ https://issues.apache.org/jira/browse/SPARK-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13958742#comment-13958742
] 

Sean Owen commented on SPARK-1406:
----------------------------------

Yes, I was going to say, there's already a pretty good implementation of this. Why not simply
call pmml-evaluator from within a Spark-based app to do scoring? does it really need any particular
support in MLlib?

MLlib does not create PMML right now, which would seem like to something to tackle before
scoring them anyway.

I have a meta-concern about piling on scope at such an early stage.

> PMML model evaluation support via MLib
> --------------------------------------
>
>                 Key: SPARK-1406
>                 URL: https://issues.apache.org/jira/browse/SPARK-1406
>             Project: Spark
>          Issue Type: New Feature
>          Components: MLlib
>            Reporter: Thomas Darimont
>
> It would be useful if spark would provide support the evaluation of PMML models (http://www.dmg.org/v4-2/GeneralStructure.html).
> This would allow to use analytical models that were created with a statistical modeling
tool like R, SAS, SPSS, etc. with Spark (MLib) which would perform the actual model evaluation
for a given input tuple. The PMML model would then just contain the "parameterization" of
an analytical model.
> Other projects like JPMML-Evaluator do a similar thing.
> https://github.com/jpmml/jpmml/tree/master/pmml-evaluator



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message