nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lewis John McGibbney <>
Subject Re: [DISCUSS] Replacing MapReduce with Tez
Date Tue, 22 Dec 2020 04:19:09 GMT
Hi dev@,
I've documented my Tez journey so far at
Things are getting quite interesting. 
Please share any experiences using Nutch on Tez or improvements to the documentation especially
any experiments you can document.
Thank you

On 2020/12/10 07:46:30, lewis john mcgibbney <> wrote: 
> Hi dev@,
> A while ago I had thought about bringing this topic up... I then got
> busy... for ages. I'll therefore get straight to the point.
> Has anyone on the dev@ team had an experience using Apache Tez -
> Tez promises multiple improvements over MapReduce. Naturally I wondered
> whether the Nutch project is at a stage of maturity now that we would look
> to leverage something more performant than legacy MapReduce.
> Were we to consider evolving Nutch by re-architecting it to use Tez as the
> processing engine, this would be a significant work effort.
> I just wanted to throw this out there for some blue-sky feedback.
> Thanks
> lewismc
> -- 

View raw message