Ferdy (JIRA) |
[jira] [Commented] (NUTCH-1039) Fetcher fails for pages without content-length header |
Thu, 01 Sep, 08:30 |
Julien Nioche (JIRA) |
[jira] [Assigned] (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967) |
Thu, 01 Sep, 11:00 |
Julien Nioche (JIRA) |
[jira] [Updated] (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967) |
Thu, 01 Sep, 11:02 |
|
[jira] [Commented] (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967) |
|
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967) |
Thu, 01 Sep, 11:04 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967) |
Thu, 01 Sep, 17:41 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967) |
Thu, 01 Sep, 17:45 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967) |
Thu, 29 Sep, 04:04 |
Julien Nioche (JIRA) |
[jira] [Resolved] (NUTCH-1073) Rename parameters 'fetcher.threads.per.host.by.ip' and 'fetcher.threads.per.host' |
Thu, 01 Sep, 13:09 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1073) Rename parameters 'fetcher.threads.per.host.by.ip' and 'fetcher.threads.per.host' |
Thu, 01 Sep, 13:27 |
Julien Nioche (JIRA) |
[jira] [Resolved] (NUTCH-1096) Empty (not null) ContentLength results in failure of fetch |
Thu, 01 Sep, 15:16 |
|
[jira] [Commented] (NUTCH-1102) Fetcher, rely on fetcher.parse directive only |
|
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-1102) Fetcher, rely on fetcher.parse directive only |
Thu, 01 Sep, 17:10 |
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-1102) Fetcher, rely on fetcher.parse directive only |
Tue, 06 Sep, 12:16 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1102) Fetcher, rely on fetcher.parse directive only |
Tue, 06 Sep, 12:24 |
Markus Jelsma (JIRA) |
[jira] [Created] (NUTCH-1103) Port protocol-sftp to 1.4 |
Thu, 01 Sep, 18:58 |
|
Re: Page deletion and tracking change between crawlings |
|
Julio Garcés Teuber |
Re: Page deletion and tracking change between crawlings |
Fri, 02 Sep, 13:06 |
Markus Jelsma |
Re: Page deletion and tracking change between crawlings |
Fri, 02 Sep, 13:25 |
Julio Garcés Teuber |
Re: Page deletion and tracking change between crawlings |
Fri, 02 Sep, 14:16 |
|
[jira] [Updated] (NUTCH-1097) application/xhtml+xml should be enabled in plugin.xml of parse-html; allow multiple mimetypes for plugin.xml |
|
Ferdy (JIRA) |
[jira] [Updated] (NUTCH-1097) application/xhtml+xml should be enabled in plugin.xml of parse-html; allow multiple mimetypes for plugin.xml |
Fri, 02 Sep, 13:34 |
Ferdy (JIRA) |
[jira] [Updated] (NUTCH-1097) application/xhtml+xml should be enabled in plugin.xml of parse-html; allow multiple mimetypes for plugin.xml |
Fri, 02 Sep, 14:02 |
Ferdy (JIRA) |
[jira] [Updated] (NUTCH-1097) application/xhtml+xml should be enabled in plugin.xml of parse-html; allow multiple mimetypes for plugin.xml |
Thu, 15 Sep, 14:22 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1097) application/xhtml+xml should be enabled in plugin.xml of parse-html; allow multiple mimetypes for plugin.xml |
Tue, 27 Sep, 13:59 |
Ferdy (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1097) application/xhtml+xml should be enabled in plugin.xml of parse-html; allow multiple mimetypes for plugin.xml |
Tue, 27 Sep, 14:11 |
Ferdy (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1097) application/xhtml+xml should be enabled in plugin.xml of parse-html; allow multiple mimetypes for plugin.xml |
Tue, 27 Sep, 14:13 |
Markus Jelsma |
Protocol not found or MalformedUrl protocol-sftp |
Fri, 02 Sep, 13:57 |
Mattmann, Chris A (388J) |
Re: Protocol not found or MalformedUrl protocol-sftp |
Fri, 02 Sep, 19:49 |
Ferdy (JIRA) |
[jira] [Commented] (NUTCH-1097) application/xhtml+xml should be enabled in plugin.xml of parse-html; allow multiple mimetypes for plugin.xml |
Fri, 02 Sep, 14:00 |
|
[jira] [Issue Comment Edited] (NUTCH-1097) application/xhtml+xml should be enabled in plugin.xml of parse-html; allow multiple mimetypes for plugin.xml |
|
Ferdy (JIRA) |
[jira] [Issue Comment Edited] (NUTCH-1097) application/xhtml+xml should be enabled in plugin.xml of parse-html; allow multiple mimetypes for plugin.xml |
Fri, 02 Sep, 14:00 |
Ferdy (JIRA) |
[jira] [Issue Comment Edited] (NUTCH-1097) application/xhtml+xml should be enabled in plugin.xml of parse-html; allow multiple mimetypes for plugin.xml |
Fri, 02 Sep, 14:06 |
|
[Nutch Wiki] Trivial Update of "RunningNutchAndSolr" by LewisJohnMcgibbney |
|
Apache Wiki |
[Nutch Wiki] Trivial Update of "RunningNutchAndSolr" by LewisJohnMcgibbney |
Fri, 02 Sep, 19:18 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "RunningNutchAndSolr" by LewisJohnMcgibbney |
Fri, 02 Sep, 19:21 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "RunningNutchAndSolr" by LewisJohnMcgibbney |
Fri, 02 Sep, 19:42 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "RunningNutchAndSolr" by LewisJohnMcgibbney |
Fri, 02 Sep, 19:43 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "RunningNutchAndSolr" by LewisJohnMcgibbney |
Fri, 02 Sep, 19:46 |
|
[Nutch Wiki] Trivial Update of "NutchTutorial" by LewisJohnMcgibbney |
|
Apache Wiki |
[Nutch Wiki] Trivial Update of "NutchTutorial" by LewisJohnMcgibbney |
Fri, 02 Sep, 19:47 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "NutchTutorial" by LewisJohnMcgibbney |
Fri, 23 Sep, 14:01 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "NutchTutorial" by LewisJohnMcgibbney |
Fri, 23 Sep, 19:16 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "NutchTutorial" by LewisJohnMcgibbney |
Fri, 30 Sep, 18:35 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "NutchTutorial" by LewisJohnMcgibbney |
Fri, 30 Sep, 18:45 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "NutchTutorial" by LewisJohnMcgibbney |
Fri, 30 Sep, 19:19 |
|
[Nutch Wiki] Trivial Update of "Archive and Legacy" by LewisJohnMcgibbney |
|
Apache Wiki |
[Nutch Wiki] Trivial Update of "Archive and Legacy" by LewisJohnMcgibbney |
Fri, 02 Sep, 19:48 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "Archive and Legacy" by LewisJohnMcgibbney |
Fri, 02 Sep, 19:56 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "Archive and Legacy" by LewisJohnMcgibbney |
Thu, 08 Sep, 20:23 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "Archive and Legacy" by LewisJohnMcgibbney |
Thu, 15 Sep, 18:58 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "Archive and Legacy" by LewisJohnMcgibbney |
Thu, 15 Sep, 18:59 |
|
[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney |
|
Apache Wiki |
[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney |
Fri, 02 Sep, 19:52 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney |
Fri, 02 Sep, 19:55 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney |
Fri, 02 Sep, 20:13 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney |
Thu, 08 Sep, 20:08 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney |
Thu, 08 Sep, 20:21 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney |
Sat, 10 Sep, 17:06 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney |
Fri, 23 Sep, 18:05 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "OldHadoopTutorial" by LewisJohnMcgibbney |
Fri, 02 Sep, 19:58 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "NutchHadoopTutorial" by LewisJohnMcgibbney |
Fri, 02 Sep, 20:10 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1592 |
Sat, 03 Sep, 04:06 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1593 |
Sun, 04 Sep, 04:13 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1594 |
Mon, 05 Sep, 04:13 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1595 |
Tue, 06 Sep, 04:12 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1596 |
Wed, 07 Sep, 04:14 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1597 |
Thu, 08 Sep, 04:05 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1598 |
Fri, 09 Sep, 04:12 |
|
[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
|
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
Tue, 06 Sep, 08:24 |
Tim Pease (JIRA) |
[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
Mon, 12 Sep, 17:50 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
Mon, 12 Sep, 20:33 |
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
Tue, 20 Sep, 12:38 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
Tue, 20 Sep, 12:54 |
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
Tue, 20 Sep, 13:26 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
Tue, 20 Sep, 14:14 |
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
Tue, 20 Sep, 14:56 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
Tue, 20 Sep, 15:02 |
|
[jira] [Updated] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
|
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
Tue, 06 Sep, 10:34 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
Mon, 12 Sep, 13:45 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
Tue, 13 Sep, 10:52 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
Fri, 16 Sep, 12:04 |
Markus Jelsma (JIRA) |
[jira] [Assigned] (NUTCH-1052) Multiple deletes of the same URL using SolrClean |
Tue, 06 Sep, 11:35 |
|
[jira] [Commented] (NUTCH-1067) Configure minimum throughput for fetcher |
|
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1067) Configure minimum throughput for fetcher |
Tue, 06 Sep, 11:53 |
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-1067) Configure minimum throughput for fetcher |
Tue, 06 Sep, 12:18 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1067) Configure minimum throughput for fetcher |
Tue, 06 Sep, 12:26 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1067) Configure minimum throughput for fetcher |
Wed, 14 Sep, 11:55 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1067) Configure minimum throughput for fetcher |
Wed, 14 Sep, 12:05 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1067) Configure minimum throughput for fetcher |
Wed, 14 Sep, 12:19 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-1067) Configure minimum throughput for fetcher |
Tue, 20 Sep, 04:16 |
|
[jira] [Commented] (NUTCH-1101) Options to purge db_gone records in updatedb |
|
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1101) Options to purge db_gone records in updatedb |
Tue, 06 Sep, 11:55 |
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-1101) Options to purge db_gone records in updatedb |
Tue, 06 Sep, 12:26 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1028) Log parser keys |
Tue, 06 Sep, 11:57 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-987) Support HTTP auth for Solr communication |
Tue, 06 Sep, 11:59 |
Markus Jelsma (JIRA) |
[jira] [Resolved] (NUTCH-987) Support HTTP auth for Solr communication |
Tue, 06 Sep, 11:59 |
Markus Jelsma (JIRA) |
[jira] [Created] (NUTCH-1104) Port issues from 1.x to trunk |
Tue, 06 Sep, 11:59 |
Markus Jelsma (JIRA) |
[jira] [Issue Comment Edited] (NUTCH-987) Support HTTP auth for Solr communication |
Tue, 06 Sep, 11:59 |
Markus Jelsma (JIRA) |
[jira] [Resolved] (NUTCH-1057) Make fetcher thread time out configurable |
Tue, 06 Sep, 11:59 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1057) Make fetcher thread time out configurable |
Tue, 06 Sep, 11:59 |
|
[jira] [Updated] (NUTCH-1104) Port issues from 1.x to trunk |
|
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1104) Port issues from 1.x to trunk |
Tue, 06 Sep, 12:00 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1104) Port issues from 1.x to trunk |
Fri, 09 Sep, 11:18 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1104) Port issues from 1.x to trunk |
Mon, 12 Sep, 12:17 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1104) Port issues from 1.x to trunk |
Wed, 14 Sep, 11:01 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1104) Port issues from 1.x to trunk |
Mon, 19 Sep, 14:54 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-1104) Port issues from 1.x to trunk |
Mon, 19 Sep, 15:27 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-1104) Port issues from 1.x to trunk |
Sat, 24 Sep, 11:29 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-1104) Port issues from 1.x to trunk |
Sat, 24 Sep, 16:27 |
Markus Jelsma (JIRA) |
[jira] [Resolved] (NUTCH-1036) Solr jobs should increment counters in Reporter |
Tue, 06 Sep, 12:02 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1036) Solr jobs should increment counters in Reporter |
Tue, 06 Sep, 12:02 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1101) Options to purge db_gone records in updatedb |
Tue, 06 Sep, 12:56 |
Ferdy Galema |
exposing generator.max.num.segments in nutch-default.xml and to Crawl command |
Tue, 06 Sep, 15:07 |
Markus Jelsma |
Re: exposing generator.max.num.segments in nutch-default.xml and to Crawl command |
Tue, 06 Sep, 15:13 |
|
[jira] [Commented] (NUTCH-1074) topN is ignored with maxNumSegments |
|
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1074) topN is ignored with maxNumSegments |
Wed, 07 Sep, 09:53 |
Robert Thomson (JIRA) |
[jira] [Commented] (NUTCH-1074) topN is ignored with maxNumSegments |
Sun, 18 Sep, 06:51 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1074) topN is ignored with maxNumSegments |
Thu, 22 Sep, 19:33 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-1074) topN is ignored with maxNumSegments |
Sat, 24 Sep, 04:07 |
|
[jira] [Issue Comment Edited] (NUTCH-1074) topN is ignored with maxNumSegments |
|
Markus Jelsma (JIRA) |
[jira] [Issue Comment Edited] (NUTCH-1074) topN is ignored with maxNumSegments |
Wed, 07 Sep, 09:55 |
Markus Jelsma (JIRA) |
[jira] [Issue Comment Edited] (NUTCH-1074) topN is ignored with maxNumSegments |
Wed, 07 Sep, 09:57 |
Robert Thomson (JIRA) |
[jira] [Issue Comment Edited] (NUTCH-1074) topN is ignored with maxNumSegments |
Sun, 18 Sep, 07:56 |