mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Pat Ferrel <pat.fer...@gmail.com>
Subject Re: Mahout K-Means - Quality of the clusters
Date Mon, 19 May 2014 21:50:47 GMT
Yep, the clue is "--evaluate=null” in the console. try "-e true". I think I ran into that
a long time ago, it should really be fixed.

Try looking here for more explanation of cluster dump: https://mahout.apache.org/users/clustering/cluster-dumper.html

The docs are being greatly improved, so there's a chance you’ll find answers there.

On May 19, 2014, at 2:34 PM, David Noel <david.i.noel@gmail.com> wrote:

It works for me with just -e. Maybe try that or --evaluate true?

On 5/19/14, hiroshi leon <hiroshi_8712@hotmail.com> wrote:
> Thanks Pat,
> 
> But how exactly can I run clusterdump using the -evaluate (-e) parameter?
> When i try to run it for example:
> 
> ./mahout clusterdump -i /user/Data-output/clusters-1-final -o analyze.txt
> --evaluate
> 
> I get a Java null pointer Exception
> 
> 14/05/19 15:02:03 INFO common.AbstractJob: Command line arguments:
> {--dictionaryType=[text],
> --distanceMeasure=[org.apache.mahout.common.distance.SquaredEuclideanDistanceMeasure],
> --endPhase=[2147483647], --evaluate=null,
> --input=[/user/Data-output/clusters-1-final], --output=[analyze.txt],
> --outputFormat=[TEXT], --startPhase=[0], --tempDir=[temp]}
> Exception in thread "main" java.lang.NullPointerException
> 
> Do I have to put a parameter to evaluate? As input for clusterdump I am
> using the output with the clusters after running mahout K-Means.
> 
>> Subject: Re: Mahout K-Means - Quality of the clusters
>> From: pat.ferrel@gmail.com
>> Date: Sat, 17 May 2014 09:43:59 -0700
>> To: user@mahout.apache.org
>> 
>> mahout  clusterdump —evaluate …
>> 
>> provides some stats
>> 
>> On May 15, 2014, at 10:23 PM, hiroshi leon <hiroshi_8712@hotmail.com>
>> wrote:
>> 
>> Hello everybody,
>> 
>> Do you know how can I get the MSE of the clusters in mahout K-Means?
>> I would like to check the quality of the clusters. Thanks!
>> 
>> 		 	   		
>> 
> 


Mime
View raw message