mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shashikant Kore <>
Subject Re: [Canopy] Picking t1 and t2 was Re: [jira] Commented: (MAHOUT-121) Speed up distance calculations for sparse vectors
Date Thu, 18 Jun 2009 07:10:42 GMT
I have  verifed the results only by "laugh-test" method.  Many of the
clusters were excellent. There were some false-positives though, which
were farther from the cetroid. It might be because I used 4
iterations. Higher number of iterations probably will give better

Right now, I don't have any visualization tools to make a confident
statement about quality of clusters.  I will report back when I have
something concrete.


On Thu, Jun 18, 2009 at 12:16 AM, Ted Dunning<> wrote:
> Shashi,
> What were the results for k-means?
> (I have zero experience with canopy, but have generally had mildly useful
> results using k-means clustering.
> On Wed, Jun 17, 2009 at 7:34 AM, Shashikant Kore <>wrote:
>> I ran Canopy and then K-Means on 50k doc vectors
> --
> Ted Dunning, CTO
> DeepDyve

View raw message