spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sean Owen (JIRA)" <>
Subject [jira] [Commented] (SPARK-1357) [MLLIB] Annotate developer and experimental API's
Date Wed, 09 Apr 2014 20:45:18 GMT


Sean Owen commented on SPARK-1357:

Yeah I think it's reasonable to say that the core ALS API is only in terms of numeric IDs
and leave a higher-level translation to the caller. Longs give that much more space to hash

The "cost" in terms of memory of something like a String is just a reference, so roughly the
same as a Double anyway. I think the more important question is whether Double is too hacky
API-wise as a representation of fundamentally non-numeric data. That's up for debate, but
yeah the question here is more about reserving the right to change.

I'll submit a PR that marks the items I mention as experimental, for consideration. See if
it seems reasonable.

> [MLLIB] Annotate developer and experimental API's
> -------------------------------------------------
>                 Key: SPARK-1357
>                 URL:
>             Project: Spark
>          Issue Type: Sub-task
>          Components: MLlib
>    Affects Versions: 1.0.0
>            Reporter: Patrick Wendell
>            Assignee: Xiangrui Meng
>             Fix For: 1.0.0

This message was sent by Atlassian JIRA

View raw message