spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nick Pentreath (JIRA)" <>
Subject [jira] [Commented] (SPARK-2336) Approximate k-NN Models for MLLib
Date Fri, 24 Feb 2017 08:11:44 GMT


Nick Pentreath commented on SPARK-2336:

I think it's safe to say that this now lives in a Spark package (that seems reasonable actively
maintained which is great) so is anyone wants this that is where to look. I further think
it's safe to say this is not going to be prioritised for MLlib, so shall we close this ticket?

> Approximate k-NN Models for MLLib
> ---------------------------------
>                 Key: SPARK-2336
>                 URL:
>             Project: Spark
>          Issue Type: New Feature
>          Components: MLlib
>            Reporter: Brian Gawalt
>            Priority: Minor
>              Labels: clustering, features
> After tackling the general k-Nearest Neighbor model as per
, there's an opportunity to also offer approximate k-Nearest Neighbor. A promising approach
would involve building a kd-tree variant within from each partition, a la
> This could offer a simple non-linear ML model that can label new data with much lower
latency than the plain-vanilla kNN versions.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message