mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Amit Krishna Joshi <>
Subject Increase timeout for running PFPGrowth
Date Mon, 22 Oct 2012 17:08:38 GMT

I am running PFP on several datasets and it works well for smaller ones (<
However, for the larger ones, I keep getting following timeout message.

Task attempt_201210140938_0105_r_000000_0 failed to report status for 600
seconds. Killing!

Is there a way I can increase the timeout?

I even tried passing these parameter but in vain:
-D mapred.task.timeout=18000000 -D

My input params are:  -s 10000 -g 1000  -tc 8  -k 50 -method mapreduce

Also, please suggest what would be the optimum value of g and k.
Number of features > million


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message