nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Nutch Wiki] Trivial Update of "MapReduce" by AndreRicardo
Date Wed, 15 Sep 2010 13:57:09 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "MapReduce" page has been changed by AndreRicardo.
http://wiki.apache.org/nutch/MapReduce?action=diff&rev1=6&rev2=7

--------------------------------------------------

  
   * In essence, it allows massive data sets to be processed in a distributed fashion by breaking
the processing into many small computations of two types:
    1. A Map operation that transforms the input into an intermediate representation.
-   2. A Reduce function that recombines the intermediate representation into the final output.
+   1. A Reduce function that recombines the intermediate representation into the final output.
  
-  * This processing model is ideal for the operations a search engine indexer like Nutch
or Google needs to perform - like computing inlinks for URLs, or building inverted indexes
- and it will [[http://wiki.apache.org/nutch-data/attachments/Presentations/attachments/mapred.pdf|"transform
Nutch"]] into a scalable, distributed search engine.
+  * This processing model is ideal for the operations a search engine indexer like Nutch
or Google needs to perform - like computing inlinks for URLs, or building inverted indexes
- and it will [[attachment:Presentations/mapred.pdf|"transform Nutch"]] into a scalable, distributed
search engine.
  

Mime
View raw message