mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Necati Demir <nde...@demir.web.tr>
Subject "File does not exist: hdfs://..." problem on Amazon
Date Mon, 10 Sep 2012 20:33:30 GMT
Hello,

I use following command to seq2sparse on Amazon.

*elastic-mapreduce --jar s3://XXX/mahout-examples-0.8-SNAPSHOT-job.jar
--main-class org.apache.mahout.driver.MahoutDriver --arg seq2sparse --arg
--input --arg s3://XXX/data.seqdir --arg -o --arg s3://XXX/data.vectors
--arg --weight --arg tfidf --arg --maxDFPercent --arg 80 --arg
--maxNGramSize --arg 3 --arg --overwrite --arg --namedVector --arg
--logNormalize --arg --numReducers --arg 1 -j JOB_ID*

But the job never completes and i have the following error. Any suggestion
to solve this issue?

*attempt_201209101934_0008_r_000000_1: SLF4J: Class path contains multiple
SLF4J bindings.*
*attempt_201209101934_0008_r_000000_1: SLF4J: Found binding in
[jar:file:/home/hadoop/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
*
*attempt_201209101934_0008_r_000000_1: SLF4J: Found binding in
[jar:file:/mnt3/var/lib/hadoop/mapred/taskTracker/hadoop/jobcache/job_201209101934_0008/jars/job.jar!/org/slf4j/impl/StaticLoggerBinder.class]
*
*attempt_201209101934_0008_r_000000_1: SLF4J: See
http://www.slf4j.org/codes.html#multiple_bindings for an explanation.*
*java.lang.IllegalStateException: /data.vectors/dictionary.file-0*
*        at
org.apache.mahout.common.iterator.sequencefile.SequenceFileIterable.iterator(SequenceFileIterable.java:63)
*
*        at
org.apache.mahout.vectorizer.term.TFPartialVectorReducer.setup(TFPartialVectorReducer.java:130)
*
*        at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)*
*        at
org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:663)*
*        at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:426)*
*        at org.apache.hadoop.mapred.Child$4.run(Child.java:255)*
*        at java.security.AccessController.doPrivileged(Native Method)*
*        at javax.security.auth.Subject.doAs(Subject.java:396)*
*        at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1132)
*
*        at org.apache.hadoop.mapred.Child.main(Child.java:249)*
*Caused by: java.io.FileNotFoundException: File does not exist: hdfs://
10.140.5.86:9000/data.vectors/dictionary.file-0*
*        at
org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:517)
*
*        at org.apache.hadoop.fs.FileSystem.getLength(FileSystem.java:798)*
*        at
org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1538)*
*        at
org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1533)*
*        at
org.apache.mahout.common.iterator.sequencefile.SequenceFileIterator.<init>(SequenceFileIterator.java:58)
*
*        at
org.apache.mahout.common.iterator.sequencefile.SequenceFileIterable.iterator(SequenceFileIterable.java:61)
*
*        ... 9 more*
*
*
*attempt_201209101934_0008_r_000000_2: SLF4J: Class path contains multiple
SLF4J bindings.*
*attempt_201209101934_0008_r_000000_2: SLF4J: Found binding in
[jar:file:/home/hadoop/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
*
*attempt_201209101934_0008_r_000000_2: SLF4J: Found binding in
[jar:file:/mnt3/var/lib/hadoop/mapred/taskTracker/hadoop/jobcache/job_201209101934_0008/jars/job.jar!/org/slf4j/impl/StaticLoggerBinder.class]
*
*attempt_201209101934_0008_r_000000_2: SLF4J: See
http://www.slf4j.org/codes.html#multiple_bindings for an explanation.*
*Exception in thread "main" java.lang.IllegalStateException: Job failed!*
*        at
org.apache.mahout.vectorizer.DictionaryVectorizer.makePartialVectors(DictionaryVectorizer.java:321)
*
*        at
org.apache.mahout.vectorizer.DictionaryVectorizer.createTermFrequencyVectors(DictionaryVectorizer.java:191)
*
*        at
org.apache.mahout.vectorizer.SparseVectorsFromSequenceFiles.run(SparseVectorsFromSequenceFiles.java:272)
*
*        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)*
*        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79)*
*        at
org.apache.mahout.vectorizer.SparseVectorsFromSequenceFiles.main(SparseVectorsFromSequenceFiles.java:55)
*
*        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)*
*        at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
*
*        at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
*
*        at java.lang.reflect.Method.invoke(Method.java:597)*
*        at
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
*
*        at
org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)*
*        at
org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)*
*        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)*
*        at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
*
*        at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
*
*        at java.lang.reflect.Method.invoke(Method.java:597)*
*        at org.apache.hadoop.util.RunJar.main(RunJar.java:187)*




-- 
Necati DEMÄ°R
--------------------

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message