mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From hiroshi leon <hiroshi_8...@hotmail.com>
Subject RE: Mahout K-Means - Quality of the clusters
Date Mon, 19 May 2014 13:05:42 GMT
Thanks Pat,

But how exactly can I run clusterdump using the -evaluate (-e) parameter?
When i try to run it for example:

./mahout clusterdump -i /user/Data-output/clusters-1-final -o analyze.txt --evaluate

I get a Java null pointer Exception

14/05/19 15:02:03 INFO common.AbstractJob: Command line arguments: {--dictionaryType=[text],
--distanceMeasure=[org.apache.mahout.common.distance.SquaredEuclideanDistanceMeasure], --endPhase=[2147483647],
--evaluate=null, --input=[/user/Data-output/clusters-1-final], --output=[analyze.txt], --outputFormat=[TEXT],
--startPhase=[0], --tempDir=[temp]}
Exception in thread "main" java.lang.NullPointerException

Do I have to put a parameter to evaluate? As input for clusterdump I am using the output with
the clusters after running mahout K-Means.

> Subject: Re: Mahout K-Means - Quality of the clusters
> From: pat.ferrel@gmail.com
> Date: Sat, 17 May 2014 09:43:59 -0700
> To: user@mahout.apache.org
> 
> mahout  clusterdump —evaluate …
> 
> provides some stats
> 
> On May 15, 2014, at 10:23 PM, hiroshi leon <hiroshi_8712@hotmail.com> wrote:
> 
> Hello everybody,
> 
> Do you know how can I get the MSE of the clusters in mahout K-Means? 
> I would like to check the quality of the clusters. Thanks!
> 
> 		 	   		  
> 
 		 	   		  
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message