mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dmitriy Lyubimov <dlie...@gmail.com>
Subject Re: SSVD fails on seq2sparse output.
Date Thu, 15 Nov 2012 15:29:28 GMT
the actual problem in the logs of the task at the backend. driver doesnt
report it. can you pull it?
 On Nov 15, 2012 3:44 AM, "Abramov Pavel" <p.abramov@rambler-co.ru> wrote:

> Hello!
>
> Trying to compute reduced rank matrix for recommendation system over users
> (20*10^6) and their ratings (~150 000 items).
>
> How to avoid the exception with Q job? :
> =======================
> Exception in thread "main" java.io.IOException: Q job unsuccessful.
> at org.apache.mahout.math.hadoop.stochasticsvd.QJob.run(QJob.java:230)
> at
> org.apache.mahout.math.hadoop.stochasticsvd.SSVDSolver.run(SSVDSolver.java:377)
> at
> org.apache.mahout.math.hadoop.stochasticsvd.SSVDCli.run(SSVDCli.java:141)
> =======================
>
> CLI parameters are:
> =======================
> mahout ssvd \
> -i /rec/sparse/tfidf-vectors/ \
> -o /rec/ssvd \
> -k 100 \
> --reduceTasks 100 \
> -ow
> =======================
> Using Mahout 0.7 on Hadoop with ~50 nodes (400 Mappers/300 reducers).
>
> My Input for SSVD is seq2sparse output, and there are 200 "Key class:
> class org.apache.hadoop.io.Text Value Class: class
> org.apache.mahout.math.VectorWritable" sequence files. 8GB total.
>
> Many thanks in advance, any suggestion is highly appreciated. I Don't know
> what to do, CF produces inaccurate results for my tasks, SVD is the only
> hope ))
>
> Regards,
> Pavel
>
>
>
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message