mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jake Mannix <jake.man...@gmail.com>
Subject Re: Dealing with kmean and meanshift output
Date Fri, 09 Apr 2010 19:06:21 GMT
We shouldn't need this, but can you try setting $HADOOP_CONF_DIR to
$HADOOP_HOME/conf and re-run?  We should be automatically doing this
if HADOOP_CONF_DIR is not set.

  -jake

On Fri, Apr 9, 2010 at 12:01 PM, adam35413 <adam.hammer@gmail.com> wrote:

>
> I took a look at the code, and the only thing that seemed to be required
> was
> the Sequence file.  I pulled the part-00000 file from output/clusterPoints/
> folder off of my Hadoop cluster, and tried the following command:
>
> bin/mahout clusterdump --seqFileDir part-00000 --output testFile.txt
>
> This resulted in the following error:
>
> no HADOOP_CONF_DIR or HADOOP_HOME set, running locally
> Apr 9, 2010 3:00:25 PM org.slf4j.impl.JCLLoggerAdapter error
> SEVERE: MahoutDriver failed with args: [--seqFileDir, part-00000, --output,
> testFile.txt, null]
> null
> Exception in thread "main" java.lang.NullPointerException
>        at
>
> org.apache.mahout.utils.clustering.ClusterDumper.printClusters(ClusterDumper.java:121)
>        at
>
> org.apache.mahout.utils.clustering.ClusterDumper.main(ClusterDumper.java:302)
>        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>        at
>
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>        at
>
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>        at java.lang.reflect.Method.invoke(Method.java:597)
>        at
>
> org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
>        at
> org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
>        at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:172)
>
> Strange, since $HADOOP_HOME is actually set.  Thoughts?
>
>
>
> Jeff Eastman wrote:
> >
> > The dictionary file contains a list (not sure how its delimited) of
> > element names for the input Vectors and is optional. See the new code in
> > trunk/utils in TestClusterDumper for some examples. I need to write test
> > sfor meanshift and also fuzzy kmeans to make sure they work but I
> > imagine they do. I also need to write tests that include the points, but
> > that appears to be done in memory so it likely won't scale to your
> > 5-node data set.
> >
> > Jeff
> >
> > adam35413 wrote:
> >> I have been able to successfully run the kmean and meanshift examples on
> >> a
> >> 5-node Hadoop cluster.  However, when it comes to dealing with the
> >> output, I
> >> am a bit confused.  I found the following page:
> >> http://cwiki.apache.org/MAHOUT/viewing-results.html, but when I went to
> >> track down the dictionary file I was unable to find it.  Do I need to
> >> generate the dictionary file separately or manually?
> >>
> >> Thanks!
> >>
> >
> >
> >
> --
> View this message in context:
> http://n3.nabble.com/Dealing-with-kmean-and-meanshift-output-tp708824p709022.html
> Sent from the Mahout User List mailing list archive at Nabble.com.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message