Nutch is a well matured, production ready Web crawler. Nutch 1.x enables
fine grained configuration, relying on Apache Hadoop™
data structures, which are great for batch processing.
The Nutch DOAP can be found at . An account of the CHANGES in this release can be seen in the
As usual in the 1.X series, release artifacts are made available as both source and binary and also available within
Maven Central as a Maven dependency.
The release is available from our DOWNLAODS PAGE.