mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jeff Eastman <j...@windwardsolutions.com>
Subject Re: mahout guide or tutorial or how to for test and run kmean on hadoop
Date Sat, 28 Aug 2010 15:21:09 GMT
  Try naming the input *directory* not the particular input file.

On 8/27/10 7:51 PM, Valerio wrote:
>
> Thanks but I need more information about the command to convert a text in a
> WritableVector and than to understand and run this file in a right file for the
> kmeans.
> I did some tempts and now i have got this result:
> Thanks for your explanation about VectorWritable.
> Actually I can modify weka folder to call the right clustering and I tried
> hadoop with the wordcount and it works.
> I read on the Mahout's quisckstart page about VectorWritable
> but I didn't how to create them, I tried this:
>
> bin/mahout kmeans --input
> /home/vuvvo/Scaricati/reuters21578/prova/prova2/vectors/part-00000 --k 3
> --output /home/vuvvo/Scaricati/reuters21578/prova/prova2/vectors/output-kmeans
> --clusters
> /home/vuvvo/Scaricati/reuters21578/prova/prova2/vectors/output-kmeans/clusters
>
>
> but i receveid this message:
>
>
> no HADOOP_CONF_DIR or HADOOP_HOME set, running locally
> 28-ago-2010 4.21.44 org.apache.hadoop.util.NativeCodeLoader<clinit>
> AVVERTENZA: Unable to load native-hadoop library for your platform... using
> builtin-java classes where applicable
> 28-ago-2010 4.21.44 org.apache.hadoop.io.compress.CodecPool getCompressor
> INFO: Got brand-new compressor
> 28-ago-2010 4.21.44 org.slf4j.impl.JCLLoggerAdapter error
> GRAVE: MahoutDriver failed with args: [--input,
> /home/vuvvo/Scaricati/reuters21578/prova/prova2/vectors/part-00000, --k, 3,
> --output, /home/vuvvo/Scaricati/reuters21578/prova/prova2/vectors/output-kmeans,
> --clusters,
> /home/vuvvo/Scaricati/reuters21578/prova/prova2/vectors/output-kmeans/clusters,
> null]
> Index: 0, Size: 0
> Exception in thread "main" java.lang.IndexOutOfBoundsException: Index: 0, Size:
> 0
> 	at java.util.ArrayList.rangeCheck(ArrayList.java:571)
> 	at java.util.ArrayList.get(ArrayList.java:349)
> 	at
> org.apache.mahout.clustering.kmeans.RandomSeedGenerator.buildRandom(
> RandomSeedGenerator.java:113)
> 	at org.apache.mahout.clustering.kmeans.KMeansDriver.main(KMeansDriver.java:164)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 	at sun.reflect.NativeMethodAccessorImpl.
> invoke(NativeMethodAccessorImpl.java:57)
> 	at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.
> java:43)
> 	at java.lang.reflect.Method.invoke(Method.java:616)
> 	at
> org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.
> java:68)
> 	at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
> 	at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:172)
>
> where I wrong?
>
>
>
>
>
>
>
>
>
>


Mime
  • Unnamed multipart/mixed (inline, None, 0 bytes)
View raw message