spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xiangrui Meng (JIRA)" <>
Subject [jira] [Created] (SPARK-1359) SGD implementation is not efficient
Date Mon, 31 Mar 2014 08:18:14 GMT
Xiangrui Meng created SPARK-1359:

             Summary: SGD implementation is not efficient
                 Key: SPARK-1359
             Project: Spark
          Issue Type: Improvement
          Components: MLlib
    Affects Versions: 0.9.0
            Reporter: Xiangrui Meng

The SGD implementation samples a mini-batch to compute the stochastic gradient. This is not
efficient because examples are provided via an iterator interface. We have to scan all of
them to obtain a sample.

This message was sent by Atlassian JIRA

View raw message