nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From viz <>
Subject setting number of reduce outputs problem
Date Sat, 12 Jan 2008 00:05:11 GMT

In our hadoop cluster I use a configuration (set in hadoop-site.xml) to have
mapred.reduce.tasks=2 by default.
However, I have few jobs were I need exactly one output from reduce (i.e.
just part-00000). I thought its staightforward:

JobConf job = new NutchJob(getConf());

But it seem any settings done this way are just ignored. Is that ok? Even
official examples say it should work. Could it be we misconfigured something
Or is there any other way to get one data file as output? 

View this message in context:
Sent from the Nutch - Dev mailing list archive at

View raw message