Markus Jelsma |
RE: [RELEASE] Apache Nutch 1.11 |
Tue, 08 Dec, 09:26 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2180) FileDumper dumps data, but breaks midway on corrupt segments |
Wed, 09 Dec, 03:39 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2180) FileDumper dumps data, but breaks midway on corrupt segments |
Thu, 10 Dec, 00:13 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2180) FileDumper dumps data, but breaks midway on corrupt segments |
Thu, 10 Dec, 03:03 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #3327 |
Thu, 10 Dec, 05:16 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-trunk #3328 |
Thu, 10 Dec, 06:42 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "Nutch2Tutorial" by LewisJohnMcgibbney |
Fri, 04 Dec, 05:23 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "Release_HOWTO" by LewisJohnMcgibbney |
Fri, 04 Dec, 07:54 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "Release_HOWTO" by LewisJohnMcgibbney |
Fri, 04 Dec, 07:59 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "Release_HOWTO" by LewisJohnMcgibbney |
Fri, 04 Dec, 08:25 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "Release_HOWTO" by LewisJohnMcgibbney |
Tue, 08 Dec, 00:50 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "Release_HOWTO" by LewisJohnMcgibbney |
Tue, 08 Dec, 00:59 |
Auro Miralles (JIRA) |
[jira] [Commented] (NUTCH-1946) Upgrade to Gora 0.6.1 |
Thu, 24 Dec, 08:51 |
Chris A. Mattmann (JIRA) |
[jira] [Commented] (NUTCH-2172) Parsing whitespace not just tabs in contenttype-mapping.txt |
Wed, 02 Dec, 14:48 |
Chris A. Mattmann (JIRA) |
[jira] [Commented] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Mon, 14 Dec, 05:07 |
Chris A. Mattmann (JIRA) |
[jira] [Commented] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Tue, 15 Dec, 22:39 |
David Johnson (JIRA) |
[jira] [Created] (NUTCH-2179) Cleanup job for SOLR Performance Boost |
Tue, 01 Dec, 19:47 |
David Johnson (JIRA) |
[jira] [Issue Comment Deleted] (NUTCH-2179) Cleanup job for SOLR Performance Boost |
Tue, 01 Dec, 20:48 |
David Johnson (JIRA) |
[jira] [Commented] (NUTCH-2179) Cleanup job for SOLR Performance Boost |
Tue, 01 Dec, 20:48 |
David Johnson (JIRA) |
[jira] [Commented] (NUTCH-2179) Cleanup job for SOLR Performance Boost |
Tue, 01 Dec, 20:48 |
David Johnson (JIRA) |
[jira] [Updated] (NUTCH-2179) Cleanup job for SOLR Performance Boost |
Tue, 01 Dec, 21:44 |
David Johnson (JIRA) |
[jira] [Updated] (NUTCH-2179) Cleanup job for SOLR Performance Boost |
Tue, 01 Dec, 21:48 |
David Johnson (JIRA) |
[jira] [Updated] (NUTCH-2179) Cleanup job for SOLR Performance Boost |
Tue, 01 Dec, 21:49 |
Harshavardhan Manjunatha (JIRA) |
[jira] [Created] (NUTCH-2180) FileDumper dumps data, but breaks midway on corrupt segments |
Thu, 03 Dec, 16:11 |
Harshavardhan Manjunatha (JIRA) |
[jira] [Commented] (NUTCH-2180) FileDumper dumps data, but breaks midway on corrupt segments |
Thu, 03 Dec, 16:15 |
Harshavardhan Manjunatha (JIRA) |
[jira] [Updated] (NUTCH-2180) FileDumper dumps data, but breaks midway on corrupt segments |
Wed, 09 Dec, 03:53 |
Harshavardhan Manjunatha (JIRA) |
[jira] [Commented] (NUTCH-2180) FileDumper dumps data, but breaks midway on corrupt segments |
Wed, 09 Dec, 17:14 |
Harshavardhan Manjunatha (JIRA) |
[jira] [Comment Edited] (NUTCH-2180) FileDumper dumps data, but breaks midway on corrupt segments |
Wed, 09 Dec, 17:14 |
Harshavardhan Manjunatha (JIRA) |
[jira] [Commented] (NUTCH-2180) FileDumper dumps data, but breaks midway on corrupt segments |
Wed, 09 Dec, 19:44 |
Harshavardhan Manjunatha (JIRA) |
[jira] [Updated] (NUTCH-2180) FileDumper dumps data, but breaks midway on corrupt segments |
Wed, 09 Dec, 19:48 |
Harshavardhan Manjunatha (JIRA) |
[jira] [Comment Edited] (NUTCH-2180) FileDumper dumps data, but breaks midway on corrupt segments |
Wed, 09 Dec, 19:52 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2177) Generator produces only one partition even in distributed mode |
Tue, 01 Dec, 13:58 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2107) plugin.xml to validate against plugin.dtd |
Tue, 01 Dec, 21:49 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2107) plugin.xml to validate against plugin.dtd |
Tue, 01 Dec, 22:05 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2176) Clean up of log4j.properties |
Wed, 02 Dec, 12:55 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2172) index-more: document format of contenttype-mapping.txt |
Sun, 06 Dec, 21:55 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2042) parse-html increase chunk size used to detect charset |
Tue, 08 Dec, 22:48 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2042) parse-html increase chunk size used to detect charset |
Tue, 08 Dec, 23:01 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2180) FileDumper dumps data, but breaks midway on corrupt segments |
Thu, 10 Dec, 05:17 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2183) Improvement to SegmentChecker for skipping non-segments present in segments directory |
Thu, 10 Dec, 05:17 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2182) Make reverseUrlDirs file dumper option hash the URL for consistency |
Wed, 16 Dec, 23:00 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2189) Domain filter must deactivate if no rules are present |
Thu, 24 Dec, 13:55 |
Jon.P |
Deploy a Nutch crawler or use Webhose.io? |
Mon, 14 Dec, 08:40 |
Julien Nioche |
Re: [VOTE] Release Apache Nutch 1.11 RC#2 |
Sat, 05 Dec, 09:49 |
Julien Nioche |
Re: [RELEASE] Apache Nutch 1.11 |
Tue, 08 Dec, 09:20 |
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-2177) Generator produces only one partition even in distributed mode |
Tue, 01 Dec, 10:43 |
Julien Nioche (JIRA) |
[jira] [Comment Edited] (NUTCH-2177) Generator produces only one partition even in distributed mode |
Tue, 01 Dec, 11:43 |
Julien Nioche (JIRA) |
[jira] [Updated] (NUTCH-2177) Generator produces only one partition even in distributed mode |
Tue, 01 Dec, 11:48 |
Julien Nioche (JIRA) |
[jira] [Resolved] (NUTCH-2177) Generator produces only one partition even in distributed mode |
Tue, 01 Dec, 12:49 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2172) Parsing whitespace not just tabs in contenttype-mapping.txt |
Thu, 03 Dec, 08:25 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2172) Parsing whitespace not just tabs in contenttype-mapping.txt |
Thu, 03 Dec, 08:26 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2172) Parsing whitespace not just tabs in contenttype-mapping.txt |
Thu, 03 Dec, 23:13 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-2128) Refactor configuration end point |
Fri, 04 Dec, 07:12 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-2149) REST endpoint to read Nutch sequence files |
Fri, 04 Dec, 07:12 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-2178) DeduplicationJob to optionall group on host or domain |
Fri, 04 Dec, 07:14 |
Lewis John McGibbney (JIRA) |
[jira] [Created] (NUTCH-2181) Add Webpage for 3rd Party Connectors/Libraries to Apache Nutch |
Tue, 08 Dec, 01:50 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-2181) Add Webpage for 3rd Party Connectors/Libraries to Apache Nutch |
Tue, 08 Dec, 01:51 |
Lewis John McGibbney (JIRA) |
[jira] [Created] (NUTCH-2183) Improvement to SegmentChecker for skipping non-segments present in segments directory |
Wed, 09 Dec, 03:01 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-2183) Improvement to SegmentChecker for skipping non-segments present in segments directory |
Wed, 09 Dec, 03:04 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-2183) Improvement to SegmentChecker for skipping non-segments present in segments directory |
Wed, 09 Dec, 03:10 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2180) FileDumper dumps data, but breaks midway on corrupt segments |
Wed, 09 Dec, 17:20 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2183) Improvement to SegmentChecker for skipping non-segments present in segments directory |
Thu, 10 Dec, 00:14 |
Lewis John McGibbney (JIRA) |
[jira] [Resolved] (NUTCH-2180) FileDumper dumps data, but breaks midway on corrupt segments |
Thu, 10 Dec, 03:04 |
Lewis John McGibbney (JIRA) |
[jira] [Resolved] (NUTCH-2183) Improvement to SegmentChecker for skipping non-segments present in segments directory |
Thu, 10 Dec, 03:06 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Sat, 12 Dec, 02:37 |
Lewis John McGibbney (JIRA) |
[jira] [Work started] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Sat, 12 Dec, 02:37 |
Lewis John McGibbney (JIRA) |
[jira] [Created] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Sat, 12 Dec, 02:37 |
Lewis John McGibbney (JIRA) |
[jira] [Created] (NUTCH-2185) protocol-soda-consumer plugin |
Mon, 14 Dec, 00:01 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Mon, 14 Dec, 20:49 |
Lewis John McGibbney (JIRA) |
[jira] [Work stopped] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Tue, 15 Dec, 22:13 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Tue, 15 Dec, 22:15 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Tue, 15 Dec, 22:15 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Tue, 15 Dec, 22:16 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Tue, 15 Dec, 22:19 |
Lewis John McGibbney (JIRA) |
[jira] [Created] (NUTCH-2186) -addBinaryContent flag can cause "String length must be a multiple of four" error in IndexingJob |
Tue, 15 Dec, 22:19 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Tue, 15 Dec, 22:24 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Wed, 16 Dec, 04:14 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Wed, 16 Dec, 04:39 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Wed, 16 Dec, 14:20 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Wed, 16 Dec, 15:33 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-1946) Upgrade to Gora 0.6.1 |
Tue, 29 Dec, 21:26 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Tue, 29 Dec, 23:57 |
Lewis John Mcgibbney |
Dropping Nutch 1.11RC#1 Artifacts |
Fri, 04 Dec, 07:51 |
Lewis John Mcgibbney |
[VOTE] Release Apache Nutch 1.11 RC#2 |
Fri, 04 Dec, 18:03 |
Lewis John Mcgibbney |
[RESULT] WAS Re: [VOTE] Release Apache Nutch 1.11 RC#2 |
Tue, 08 Dec, 00:41 |
Lewis John Mcgibbney |
Fwd: ApacheCon NA 2015 Travel Assistance Applications now open! |
Tue, 08 Dec, 04:21 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-2177) Generator produces only one partition even in distributed mode |
Tue, 01 Dec, 11:53 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-2176) Clean up of log4j.properties |
Wed, 02 Dec, 12:41 |
Markus Jelsma (JIRA) |
[jira] [Resolved] (NUTCH-2176) Clean up of log4j.properties |
Wed, 02 Dec, 12:41 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1449) Optionally delete documents skipped by IndexingFilters |
Tue, 08 Dec, 13:03 |
Markus Jelsma (JIRA) |
[jira] [Reopened] (NUTCH-1995) Add support for wildcard to http.robot.rules.whitelist |
Thu, 10 Dec, 15:36 |
Markus Jelsma (JIRA) |
[jira] [Comment Edited] (NUTCH-1995) Add support for wildcard to http.robot.rules.whitelist |
Thu, 10 Dec, 15:37 |
Markus Jelsma (JIRA) |
[jira] [Closed] (NUTCH-1995) Add support for wildcard to http.robot.rules.whitelist |
Thu, 10 Dec, 16:01 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1449) Optionally delete documents skipped by IndexingFilters |
Thu, 10 Dec, 16:32 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Wed, 16 Dec, 09:34 |
Markus Jelsma (JIRA) |
[jira] [Comment Edited] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Wed, 16 Dec, 09:43 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-2184) Enable IndexingJob to function with no crawldb |
Wed, 16 Dec, 14:26 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-2188) While crawling with solr url (kerberos enabled) Error: org.apache.solr.common.SolrException: Unauthorized |
Thu, 17 Dec, 10:52 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-2188) While crawling with solr url (kerberos enabled) Error: org.apache.solr.common.SolrException: Unauthorized |
Fri, 18 Dec, 13:59 |
Markus Jelsma (JIRA) |
[jira] [Created] (NUTCH-2189) Domain filter must deactivate if no rules are present |
Mon, 21 Dec, 12:34 |