nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lewis John McGibbney <lewi...@apache.org>
Subject Re: [DISCUSS] Replacing MapReduce with Tez
Date Tue, 22 Dec 2020 04:19:09 GMT
Hi dev@,
I've documented my Tez journey so far at https://cwiki.apache.org/confluence/display/NUTCH/Running+Nutch+on+Tez
Things are getting quite interesting. 
Please share any experiences using Nutch on Tez or improvements to the documentation especially
any experiments you can document.
Thank you

On 2020/12/10 07:46:30, lewis john mcgibbney <lewismc@apache.org> wrote: 
> Hi dev@,
> A while ago I had thought about bringing this topic up... I then got
> busy... for ages. I'll therefore get straight to the point.
> Has anyone on the dev@ team had an experience using Apache Tez -
> tez.apache.org?
> Tez promises multiple improvements over MapReduce. Naturally I wondered
> whether the Nutch project is at a stage of maturity now that we would look
> to leverage something more performant than legacy MapReduce.
> Were we to consider evolving Nutch by re-architecting it to use Tez as the
> processing engine, this would be a significant work effort.
> I just wanted to throw this out there for some blue-sky feedback.
> Thanks
> lewismc
> 
> -- 
> http://home.apache.org/~lewismc/
> http://people.apache.org/keys/committer/lewismc
> 

Mime
View raw message