Apache Jenkins Server |
Build failed in Jenkins: nutch-trunk-maven #337 |
Sun, 01 Jul, 05:31 |
Jianyun He (JIRA) |
[jira] [Created] (NUTCH-1416) Can not update the index |
Sun, 01 Jul, 08:10 |
Jianyun He (JIRA) |
[jira] [Updated] (NUTCH-1416) Can not update the index |
Sun, 01 Jul, 12:54 |
michael F |
Add me to the Mailing list |
Sun, 01 Jul, 14:48 |
Alexander Aristov |
nucth and mahout integration |
Sun, 01 Jul, 19:02 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-nutchgora #297 |
Mon, 02 Jul, 04:20 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-trunk #1885 |
Mon, 02 Jul, 04:32 |
Apache Jenkins Server |
Build failed in Jenkins: nutch-trunk-maven #338 |
Mon, 02 Jul, 05:02 |
Mathijs Homminga |
Re: nucth and mahout integration |
Mon, 02 Jul, 06:08 |
Julien Nioche |
Re: nucth and mahout integration |
Mon, 02 Jul, 09:13 |
Julien Nioche (JIRA) |
[jira] [Updated] (NUTCH-1087) Deprecate crawl command and replace with example script |
Mon, 02 Jul, 12:14 |
Julien Nioche (JIRA) |
[jira] [Assigned] (NUTCH-1087) Deprecate crawl command and replace with example script |
Mon, 02 Jul, 12:16 |
Lewis John Mcgibbney |
Re: Add me to the Mailing list |
Mon, 02 Jul, 12:22 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-1415) release packages to contain top level folder apache-nutch-x.x |
Mon, 02 Jul, 12:29 |
Lewis John Mcgibbney |
Re: Nutch Author, Publication, and Religion Detection |
Mon, 02 Jul, 12:32 |
Lewis John McGibbney (JIRA) |
[jira] [Created] (NUTCH-1417) Remove o.a.n.metadata.Office |
Mon, 02 Jul, 12:37 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-1415) release packages to contain top level folder apache-nutch-x.x |
Mon, 02 Jul, 17:47 |
Lewis John Mcgibbney |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Mon, 02 Jul, 17:49 |
Lewis John Mcgibbney |
Re: [VOTE] Apache Nutch 1.5.1 Release Candidate |
Mon, 02 Jul, 18:01 |
Arijit Mukherjee (JIRA) |
[jira] [Created] (NUTCH-1418) error parsing robots rules- can't decode path: /wiki/Wikipedia%3Mediation_Committee/ |
Mon, 02 Jul, 18:07 |
Ken Krugler (JIRA) |
[jira] [Commented] (NUTCH-1418) error parsing robots rules- can't decode path: /wiki/Wikipedia%3Mediation_Committee/ |
Mon, 02 Jul, 18:35 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1418) error parsing robots rules- can't decode path: /wiki/Wikipedia%3Mediation_Committee/ |
Mon, 02 Jul, 18:50 |
Julien Nioche |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Mon, 02 Jul, 20:26 |
Mattmann, Chris A (388J) |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Tue, 03 Jul, 01:24 |
Arijit Mukherjee (JIRA) |
[jira] [Commented] (NUTCH-1418) error parsing robots rules- can't decode path: /wiki/Wikipedia%3Mediation_Committee/ |
Tue, 03 Jul, 05:51 |
Apache Jenkins Server |
Jenkins build is back to normal : nutch-trunk-maven #339 |
Tue, 03 Jul, 08:33 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-1415) release packages to contain top level folder apache-nutch-x.x |
Tue, 03 Jul, 09:05 |
Julien Nioche |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Tue, 03 Jul, 11:00 |
Sebastian Nagel (JIRA) |
[jira] [Created] (NUTCH-1419) parsechecker and indexchecker to report protocol status |
Tue, 03 Jul, 12:47 |
JAB |
Re: Nutch Author, Publication, and Religion Detection |
Tue, 03 Jul, 13:33 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-1419) parsechecker and indexchecker to report protocol status |
Tue, 03 Jul, 13:41 |
Mattmann, Chris A (388J) |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Tue, 03 Jul, 14:26 |
Julien Nioche |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Tue, 03 Jul, 14:49 |
Lewis John Mcgibbney |
Re: Nutch Author, Publication, and Religion Detection |
Tue, 03 Jul, 14:55 |
Mattmann, Chris A (388J) |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Tue, 03 Jul, 15:16 |
Mattmann, Chris A (388J) |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Tue, 03 Jul, 16:12 |
Lewis John Mcgibbney |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Tue, 03 Jul, 18:11 |
Mattmann, Chris A (388J) |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Tue, 03 Jul, 18:24 |
Lewis John Mcgibbney |
[VOTE] Apache Nutch 1.5.1 RC#3 |
Tue, 03 Jul, 18:42 |
Lewis John Mcgibbney |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Tue, 03 Jul, 18:49 |
Mattmann, Chris A (388J) |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Wed, 04 Jul, 06:18 |
Julien Nioche |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Wed, 04 Jul, 07:50 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1306) Add option to not commit and clarify existing solr.commit.size |
Wed, 04 Jul, 08:34 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1306) Add option to not commit and clarify existing solr.commit.size |
Wed, 04 Jul, 08:36 |
Lewis John Mcgibbney |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Wed, 04 Jul, 10:27 |
Markus Jelsma (JIRA) |
[jira] [Created] (NUTCH-1420) Get rid of the dreaded � |
Wed, 04 Jul, 12:58 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1420) Get rid of the dreaded � |
Wed, 04 Jul, 12:58 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling |
Wed, 04 Jul, 13:08 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1405) Allow to overwrite CrawlDatum's with injected entries |
Wed, 04 Jul, 13:24 |
Ferdy Galema (JIRA) |
[jira] [Comment Edited] (NUTCH-1360) Suport the storing of IP address connected to when web crawling |
Wed, 04 Jul, 13:42 |
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-1405) Allow to overwrite CrawlDatum's with injected entries |
Wed, 04 Jul, 14:30 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1405) Allow to overwrite CrawlDatum's with injected entries |
Wed, 04 Jul, 14:36 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling |
Wed, 04 Jul, 14:40 |
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-1405) Allow to overwrite CrawlDatum's with injected entries |
Wed, 04 Jul, 14:46 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1405) Allow to overwrite CrawlDatum's with injected entries |
Wed, 04 Jul, 14:50 |
Mattmann, Chris A (388J) |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Wed, 04 Jul, 16:17 |
Mattmann, Chris A (388J) |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Wed, 04 Jul, 18:11 |
Lewis John McGibbney (JIRA) |
[jira] [Reopened] (NUTCH-1360) Suport the storing of IP address connected to when web crawling |
Wed, 04 Jul, 20:33 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #299 |
Thu, 05 Jul, 04:04 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1887 |
Thu, 05 Jul, 04:06 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1405) Allow to overwrite CrawlDatum's with injected entries |
Thu, 05 Jul, 09:25 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1233) Rely on Tika for outlink extraction |
Thu, 05 Jul, 09:33 |
Sebastian Nagel (JIRA) |
[jira] [Created] (NUTCH-1421) RegexURLNormalizer to only skip rules with invalid patterns |
Thu, 05 Jul, 09:47 |
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-1405) Allow to overwrite CrawlDatum's with injected entries |
Thu, 05 Jul, 09:51 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-1421) RegexURLNormalizer to only skip rules with invalid patterns |
Thu, 05 Jul, 09:59 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1421) RegexURLNormalizer to only skip rules with invalid patterns |
Thu, 05 Jul, 10:33 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1414) Date extraction parse filter |
Thu, 05 Jul, 15:59 |
Markus Jelsma (JIRA) |
[jira] [Resolved] (NUTCH-1405) Allow to overwrite CrawlDatum's with injected entries |
Thu, 05 Jul, 16:59 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-1405) Allow to overwrite CrawlDatum's with injected entries |
Thu, 05 Jul, 18:18 |
Apache Jenkins Server |
Build failed in Jenkins: nutch-trunk-maven #342 |
Fri, 06 Jul, 05:53 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-nutchgora #300 |
Fri, 06 Jul, 05:58 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-trunk #1888 |
Fri, 06 Jul, 06:09 |
Lewis John Mcgibbney |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Fri, 06 Jul, 10:19 |
Julien Nioche |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Fri, 06 Jul, 10:52 |
Sebastian Nagel (JIRA) |
[jira] [Created] (NUTCH-1422) reset signature for redirects |
Fri, 06 Jul, 14:05 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-1422) reset signature for redirects |
Fri, 06 Jul, 14:09 |
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-1414) Date extraction parse filter |
Fri, 06 Jul, 15:11 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1414) Date extraction parse filter |
Fri, 06 Jul, 15:43 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "ContributorsGroup" by LewisJohnMcgibbney |
Fri, 06 Jul, 20:36 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "AdminGroup" by LewisJohnMcgibbney |
Fri, 06 Jul, 20:36 |
Alexander Kingson (JIRA) |
[jira] [Updated] (NUTCH-1411) nutchgora fetcher.store.content does not work |
Fri, 06 Jul, 20:54 |
Alexander Kingson (JIRA) |
[jira] [Comment Edited] (NUTCH-1411) nutchgora fetcher.store.content does not work |
Fri, 06 Jul, 20:56 |
Mattmann, Chris A (388J) |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Fri, 06 Jul, 21:33 |
Apache Wiki |
[Nutch Wiki] Update of "Support" by subhankarray |
Sat, 07 Jul, 00:45 |
Apache Jenkins Server |
Build failed in Jenkins: nutch-trunk-maven #343 |
Sat, 07 Jul, 05:01 |
Julien Nioche |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Sat, 07 Jul, 20:58 |
Lewis John Mcgibbney |
Re: [VOTE] Apache Nutch 1.5.1 RC#3 |
Sat, 07 Jul, 21:07 |
Lewis John Mcgibbney |
[ANNOUNCEMENT] Apache Nutch v2.0 Release |
Sat, 07 Jul, 22:37 |
Mattmann, Chris A (388J) |
Re: [VOTE] Apache Nutch 1.5.1 RC#3 |
Sat, 07 Jul, 22:42 |
Lewis John Mcgibbney |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Sat, 07 Jul, 22:44 |
Mattmann, Chris A (388J) |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Sat, 07 Jul, 22:47 |
Julien Nioche |
Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 |
Sun, 08 Jul, 04:41 |
Apache Jenkins Server |
Build failed in Jenkins: nutch-trunk-maven #344 |
Sun, 08 Jul, 05:02 |
Julien Nioche |
Re: [VOTE] Apache Nutch 1.5.1 RC#3 |
Sun, 08 Jul, 07:26 |
Sebastian Nagel |
Re: [VOTE] Apache Nutch 1.5.1 RC#3 |
Sun, 08 Jul, 19:55 |
Apache Jenkins Server |
Build failed in Jenkins: nutch-trunk-maven #345 |
Mon, 09 Jul, 05:02 |
Julien Nioche |
[PROPOSAL] Rename branch nutchgora into 2.x |
Mon, 09 Jul, 10:37 |
Ferdy Galema |
Re: [PROPOSAL] Rename branch nutchgora into 2.x |
Mon, 09 Jul, 11:02 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1306) Add option to not commit and clarify existing solr.commit.size |
Mon, 09 Jul, 11:33 |
Ferdy Galema (JIRA) |
[jira] [Resolved] (NUTCH-1306) Add option to not commit and clarify existing solr.commit.size |
Mon, 09 Jul, 11:40 |