Apache Jenkins Server |
Jenkins build is back to normal : Nutch-nutchgora #117 |
Sun, 01 Jan, 04:15 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1711 |
Sun, 01 Jan, 04:16 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1210) DomainBlacklistFilter |
Mon, 02 Jan, 10:38 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1239) Webgraph should remove deleted pages from segment input |
Mon, 02 Jan, 11:44 |
Markus Jelsma (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1240) Domain blacklist URL filter |
Mon, 02 Jan, 11:54 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1210) DomainBlacklistFilter |
Mon, 02 Jan, 11:56 |
Markus Jelsma (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1212) ParseOutputFormat has redundant code |
Mon, 02 Jan, 11:58 |
Markus Jelsma (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1017) Exception getting mime type by name |
Mon, 02 Jan, 12:00 |
Markus Jelsma (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1041) Not reading mime-type correctly |
Mon, 02 Jan, 12:00 |
Markus Jelsma (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1064) o.a.n.util.MimeUtil uses deprecated Tika methods |
Mon, 02 Jan, 12:02 |
Markus Jelsma (Closed) (JIRA) |
[jira] [Closed] (NUTCH-1106) Options to skip url's based on length |
Mon, 02 Jan, 12:08 |
Markus Jelsma (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1106) Options to skip url's based on length |
Mon, 02 Jan, 12:08 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1138) remove LogUtil from trunk and nutch gora |
Mon, 02 Jan, 12:12 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1232) Remove host|site fields from index-basic |
Mon, 02 Jan, 12:12 |
Markus Jelsma (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1239) Webgraph should remove deleted pages from segment input |
Mon, 02 Jan, 13:12 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1232) Remove host field from index-basic |
Mon, 02 Jan, 13:14 |
Markus Jelsma (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1232) Remove host field from index-basic |
Mon, 02 Jan, 13:18 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1239) Webgraph should remove deleted pages from segment input |
Mon, 02 Jan, 14:06 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1232) Remove host field from index-basic |
Mon, 02 Jan, 14:06 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1713 |
Tue, 03 Jan, 04:10 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1232) Remove host field from index-basic |
Tue, 03 Jan, 04:12 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1239) Webgraph should remove deleted pages from segment input |
Tue, 03 Jan, 04:12 |
Lewis John McGibbney (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1138) remove LogUtil from trunk and nutch gora |
Tue, 03 Jan, 13:11 |
Markus Jelsma |
What to do with items for which is no parser? |
Tue, 03 Jan, 17:18 |
Lewis John Mcgibbney |
Re: What to do with items for which is no parser? |
Tue, 03 Jan, 21:43 |
Markus Jelsma |
Re: What to do with items for which is no parser? |
Tue, 03 Jan, 22:12 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1714 |
Wed, 04 Jan, 04:20 |
Markus Jelsma (Created) (JIRA) |
[jira] [Created] (NUTCH-1241) CrawlDBScanner should also be able to find records |
Wed, 04 Jan, 07:59 |
Julien Nioche (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1241) CrawlDBScanner should also be able to find records |
Wed, 04 Jan, 09:11 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1241) CrawlDBScanner should also be able to find records |
Wed, 04 Jan, 09:39 |
Julien Nioche |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Wed, 04 Jan, 09:52 |
Julien Nioche (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1241) CrawlDBScanner should also be able to find records |
Wed, 04 Jan, 09:57 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1241) CrawlDBScanner should also be able to find records |
Wed, 04 Jan, 10:01 |
Markus Jelsma |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Wed, 04 Jan, 10:11 |
Julien Nioche (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1241) CrawlDBScanner should also be able to find records |
Wed, 04 Jan, 14:14 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1241) CrawlDBScanner should also be able to find records |
Wed, 04 Jan, 14:30 |
Julien Nioche |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Wed, 04 Jan, 16:45 |
Julien Nioche |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Wed, 04 Jan, 17:12 |
Markus Jelsma |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Wed, 04 Jan, 17:16 |
Julien Nioche |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Wed, 04 Jan, 17:28 |
Lewis John Mcgibbney |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Wed, 04 Jan, 17:35 |
Markus Jelsma |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Wed, 04 Jan, 17:41 |
Markus Jelsma |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Wed, 04 Jan, 17:44 |
Lewis John Mcgibbney |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Wed, 04 Jan, 18:43 |
Edward Drapkin (Created) (JIRA) |
[jira] [Created] (NUTCH-1242) Allow disabling of URL Filters in ParseSegment |
Wed, 04 Jan, 22:31 |
Edward Drapkin (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1242) Allow disabling of URL Filters in ParseSegment |
Wed, 04 Jan, 22:33 |
X Yang (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1220) Upgrade Solr deps |
Thu, 05 Jan, 00:08 |
X Yang (Issue Comment Edited) (JIRA) |
[jira] [Issue Comment Edited] (NUTCH-1220) Upgrade Solr deps |
Thu, 05 Jan, 00:14 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1715 |
Thu, 05 Jan, 04:17 |
Julien Nioche (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1146) Get rid of _success files in webgraph code |
Thu, 05 Jan, 11:07 |
Julien Nioche |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Thu, 05 Jan, 11:14 |
Markus Jelsma |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Thu, 05 Jan, 11:28 |
Julien Nioche (Created) (JIRA) |
[jira] [Created] (NUTCH-1243) Junit jar removed from lib |
Thu, 05 Jan, 11:31 |
Julien Nioche |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Thu, 05 Jan, 11:35 |
Julien Nioche (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1243) Junit jar removed from lib |
Thu, 05 Jan, 11:41 |
Markus Jelsma |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Thu, 05 Jan, 11:42 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1146) Get rid of _success files in webgraph code |
Thu, 05 Jan, 12:03 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-trunk #1716 |
Thu, 05 Jan, 12:48 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1243) Junit jar removed from lib |
Thu, 05 Jan, 12:49 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1146) Get rid of _success files in webgraph code |
Thu, 05 Jan, 12:49 |
Lewis John Mcgibbney |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Thu, 05 Jan, 12:50 |
Julien Nioche |
Re: Build failed in Jenkins: Nutch-trunk #1714 |
Thu, 05 Jan, 12:52 |
Lewis John McGibbney (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1237) Improve javac arguements for more verbose output |
Thu, 05 Jan, 13:11 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1243) Junit jar removed from lib |
Thu, 05 Jan, 13:13 |
Markus Jelsma (Created) (JIRA) |
[jira] [Created] (NUTCH-1244) CrawlDBDumper to filter by regex |
Thu, 05 Jan, 14:11 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1244) CrawlDBDumper to filter by regex |
Thu, 05 Jan, 14:15 |
Julien Nioche (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1244) CrawlDBDumper to filter by regex |
Thu, 05 Jan, 14:47 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1244) CrawlDBDumper to filter by regex |
Thu, 05 Jan, 14:57 |
Lewis John McGibbney (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1237) Improve javac arguements for more verbose output |
Thu, 05 Jan, 15:05 |
Lewis John McGibbney (Closed) (JIRA) |
[jira] [Closed] (NUTCH-1237) Improve javac arguements for more verbose output |
Thu, 05 Jan, 15:05 |
Lewis John McGibbney (Closed) (JIRA) |
[jira] [Closed] (NUTCH-1236) Add link to site documentation to download older versions of Nutch. |
Thu, 05 Jan, 15:07 |
Lewis John McGibbney (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1236) Add link to site documentation to download older versions of Nutch. |
Thu, 05 Jan, 15:07 |
Julien Nioche (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1244) CrawlDBDumper to filter by regex |
Thu, 05 Jan, 15:19 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1237) Improve javac arguements for more verbose output |
Thu, 05 Jan, 15:25 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1244) CrawlDBDumper to filter by regex |
Thu, 05 Jan, 15:48 |
Sebastian Nagel (Created) (JIRA) |
[jira] [Created] (NUTCH-1245) URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again |
Thu, 05 Jan, 16:23 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1245) URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again |
Thu, 05 Jan, 16:49 |
Ian Piper (Commented) (JIRA) |
[jira] [Commented] (NUTCH-827) HTTP POST Authentication |
Thu, 05 Jan, 17:11 |
Lewis John McGibbney (Commented) (JIRA) |
[jira] [Commented] (NUTCH-926) Nutch follows wrong url in <META http-equiv="refresh" tag |
Thu, 05 Jan, 17:21 |
Lewis John McGibbney (Commented) (JIRA) |
[jira] [Commented] (NUTCH-874) Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora |
Thu, 05 Jan, 17:26 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1237) Improve javac arguements for more verbose output |
Fri, 06 Jan, 04:10 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1237) Improve javac arguements for more verbose output |
Fri, 06 Jan, 04:24 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-827) HTTP POST Authentication |
Fri, 06 Jan, 10:29 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1245) URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again |
Fri, 06 Jan, 10:29 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1242) Allow disabling of URL Filters in ParseSegment |
Fri, 06 Jan, 10:29 |
Markus Jelsma |
Re: [jira] [Commented] (NUTCH-1237) Improve javac arguements for more verbose output |
Fri, 06 Jan, 14:03 |
Lewis John Mcgibbney |
Re: [jira] [Commented] (NUTCH-1237) Improve javac arguements for more verbose output |
Fri, 06 Jan, 14:14 |
Mattmann, Chris A (388J) |
Re: [jira] [Commented] (NUTCH-1237) Improve javac arguements for more verbose output |
Fri, 06 Jan, 14:19 |
Markus Jelsma |
Re: [jira] [Commented] (NUTCH-1237) Improve javac arguements for more verbose output |
Fri, 06 Jan, 14:29 |
Lewis John Mcgibbney |
Re: [jira] [Commented] (NUTCH-1237) Improve javac arguements for more verbose output |
Fri, 06 Jan, 14:30 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #124 |
Sun, 08 Jan, 04:16 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-nutchgora #125 |
Mon, 09 Jan, 04:11 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1244) CrawlDBDumper to filter by regex |
Mon, 09 Jan, 14:42 |
Lewis John McGibbney (Updated) (JIRA) |
[jira] [Updated] (NUTCH-840) Port tests from parse-html to parse-tika |
Mon, 09 Jan, 14:52 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1139) Indexer to delete documents |
Mon, 09 Jan, 15:50 |
Julien Nioche (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1244) CrawlDBDumper to filter by regex |
Mon, 09 Jan, 15:56 |
Markus Jelsma (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1244) CrawlDBDumper to filter by regex |
Mon, 09 Jan, 16:02 |
Markus Jelsma |
edit wiki? |
Mon, 09 Jan, 16:04 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "AdminGroup" by LewisJohnMcgibbney |
Mon, 09 Jan, 16:10 |
Lewis John Mcgibbney |
Re: edit wiki? |
Mon, 09 Jan, 16:10 |