spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Reza Zadeh (JIRA)" <>
Subject [jira] [Commented] (SPARK-3434) Distributed block matrix
Date Fri, 17 Oct 2014 21:21:34 GMT


Reza Zadeh commented on SPARK-3434:

Thanks Shivaram! As discussed over the phone, we will use your design and build upon it, so
that you can focus on the linear algebraic operations such as TSQR.

> Distributed block matrix
> ------------------------
>                 Key: SPARK-3434
>                 URL:
>             Project: Spark
>          Issue Type: New Feature
>          Components: MLlib
>            Reporter: Xiangrui Meng
>            Assignee: Shivaram Venkataraman
> This JIRA is for discussing distributed matrices stored in block sub-matrices. The main
challenge is the partitioning scheme to allow adding linear algebra operations in the future,
> 1. matrix multiplication
> 2. matrix factorization (QR, LU, ...)
> Let's discuss the partitioning and storage and how they fit into the above use cases.
> Questions:
> 1. Should it be backed by a single RDD that contains all of the sub-matrices or many
RDDs with each contains only one sub-matrix?

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message