spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "DEVAN M.S." <msdeva...@gmail.com>
Subject Re: KNN for large data set
Date Thu, 22 Jan 2015 13:29:16 GMT
Thanks Xiangrui Meng will try this.

And, found this https://github.com/kaushikranjan/knnJoin also.
Will this work with double data ? Can we find out z value of
*Vector(10.3,4.5,3,5)* ?






On Thu, Jan 22, 2015 at 12:25 AM, Xiangrui Meng <mengxr@gmail.com> wrote:

> For large datasets, you need hashing in order to compute k-nearest
> neighbors locally. You can start with LSH + k-nearest in Google
> scholar: http://scholar.google.com/scholar?q=lsh+k+nearest -Xiangrui
>
> On Tue, Jan 20, 2015 at 9:55 PM, DEVAN M.S. <msdevanms@gmail.com> wrote:
> > Hi all,
> >
> > Please help me to find out best way for K-nearest neighbor using spark
> for
> > large data sets.
> >
>

Mime
View raw message