Alexander Ulanov created SPARK-5362:
---------------------------------------
Summary: Gradient and Optimizer to support generic output (instead of label)
and data batches
Key: SPARK-5362
URL: https://issues.apache.org/jira/browse/SPARK-5362
Project: Spark
Issue Type: Improvement
Components: MLlib
Affects Versions: 1.2.0
Reporter: Alexander Ulanov
Fix For: 1.3.0
Currently, Gradient and Optimizer interfaces support data in form of RDD[Double, Vector] which
refers to label and features. This limits its application to classification problems. For
example, artificial neural network demands Vector as output (instead of label: Double). Moreover,
current interface does not support data batches. I propose to replace label: Double with output:
Vector. It enables passing generic output instead of label and also passing data and output
batches stored in corresponding vectors.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org
|