nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Markus Jelsma <markus.jel...@openindex.io>
Subject RE: [ANNOUNCE] Apache Nutch 1.12 Release
Date Tue, 21 Jun 2016 11:54:58 GMT
To those who upgrade,

The release announcement is missing some additional upgrade notes.  If you use the db.ignore.internal|external.links
parameters, read the points below.

Regards,
Markus

---------------------------------------------

Fellow committers, Nutch 1.12 contains a breaking change NUTCH-2220. Please use the note below
and
in the release announcement and keep it on top in this CHANGES.txt for the Nutch 1.12 release.

* replace your old conf/nutch-default.xml with the conf/nutch-default.xml from Nutch 1.12
release

* if you use LinkDB (e.g. invertlinks) and modified parameters db.max.inlinks and/or db.max.anchor.length
  and/or db.ignore.internal.links, rename those parameters to linkdb.max.inlinks and
  linkdb.max.anchor.length and linkdb.ignore.internal.links

* db.ignore.internal.links and db.ignore.external.links now operate on the CrawlDB only

* linkdb.ignore.internal.links and linkdb.ignore.external.links now operate on the LinkDB
only

 
-----Original message-----
> From:lewis john mcgibbney <lewismc@apache.org>
> Sent: Monday 20th June 2016 4:01
> To: user@nutch.apache.org; dev@nutch.apache.org; announce@apache.org
> Subject: [ANNOUNCE] Apache Nutch 1.12 Release
> 
> The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.12,
we advise all
 
> current users and developers of the 1.X series to upgrade to this release. 
> Nutch is a well matured, production ready Web crawler. Nutch 1.x enables 
 
>      fine grained configuration, relying on Apache Hadoop™ 
 
>      data structures, which are great for batch processing.
> This release is the result of many months of work and over 40 issues 
 
> addressed. For a complete overview of these issues please see the
 
> release report <https://s.apache.org/nutch1.12>. 
> As usual in the 1.X series, release artifacts are made available as both source and binary
and also available within
 
> Maven Central <http://search.maven.org/#search%7Cgav%7C1%7Cg%3A%22org.apache.nutch%22%20AND%20a%3A%22nutch%22>
as a Maven dependency.
 
> The release is available from our DOWNLOADS PAGE <http://nutch.apache.org/downloads.html>.

> The Nutch DOAP can be found at https://svn.apache.org/repos/asf/nutch/cms_site/trunk/content/doap.rdf
<https://svn.apache.org/repos/asf/nutch/cms_site/trunk/content/doap.rdf>
> Lewis
> (On behalf of the Nutch PMC)

Mime
View raw message