nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Markus Jelsma <>
Subject RE: [ANNOUNCE] Apache Nutch 1.12 Release
Date Tue, 21 Jun 2016 11:54:58 GMT
To those who upgrade,

The release announcement is missing some additional upgrade notes.  If you use the db.ignore.internal|external.links
parameters, read the points below.



Fellow committers, Nutch 1.12 contains a breaking change NUTCH-2220. Please use the note below
in the release announcement and keep it on top in this CHANGES.txt for the Nutch 1.12 release.

* replace your old conf/nutch-default.xml with the conf/nutch-default.xml from Nutch 1.12

* if you use LinkDB (e.g. invertlinks) and modified parameters db.max.inlinks and/or db.max.anchor.length
  and/or db.ignore.internal.links, rename those parameters to linkdb.max.inlinks and
  linkdb.max.anchor.length and linkdb.ignore.internal.links

* db.ignore.internal.links and db.ignore.external.links now operate on the CrawlDB only

* linkdb.ignore.internal.links and linkdb.ignore.external.links now operate on the LinkDB

-----Original message-----
> From:lewis john mcgibbney <>
> Sent: Monday 20th June 2016 4:01
> To:;;
> Subject: [ANNOUNCE] Apache Nutch 1.12 Release
> The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.12,
we advise all
> current users and developers of the 1.X series to upgrade to this release. 
> Nutch is a well matured, production ready Web crawler. Nutch 1.x enables 
>      fine grained configuration, relying on Apache Hadoop™ 
>      data structures, which are great for batch processing.
> This release is the result of many months of work and over 40 issues 
> addressed. For a complete overview of these issues please see the
> release report <>. 
> As usual in the 1.X series, release artifacts are made available as both source and binary
and also available within
> Maven Central <>
as a Maven dependency.
> The release is available from our DOWNLOADS PAGE <>.

> The Nutch DOAP can be found at
> Lewis
> (On behalf of the Nutch PMC)

View raw message