Markus Jelsma |
RE: Apache Nutch being used at National Snow and Ice Data Center: ESIP Federation |
Tue, 17 Jul, 22:27 |
Markus Jelsma |
RE: Apache Nutch being used at National Snow and Ice Data Center: ESIP Federation |
Wed, 18 Jul, 21:18 |
Alexander Aristov |
nucth and mahout integration |
Sun, 01 Jul, 19:02 |
Alexander Kingson (JIRA) |
[jira] [Updated] (NUTCH-1411) nutchgora fetcher.store.content does not work |
Fri, 06 Jul, 20:54 |
Alexander Kingson (JIRA) |
[jira] [Comment Edited] (NUTCH-1411) nutchgora fetcher.store.content does not work |
Fri, 06 Jul, 20:56 |
Apache Jenkins Server |
Build failed in Jenkins: nutch-trunk-maven #337 |
Sun, 01 Jul, 05:31 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-nutchgora #297 |
Mon, 02 Jul, 04:20 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-trunk #1885 |
Mon, 02 Jul, 04:32 |
Apache Jenkins Server |
Build failed in Jenkins: nutch-trunk-maven #338 |
Mon, 02 Jul, 05:02 |
Apache Jenkins Server |
Jenkins build is back to normal : nutch-trunk-maven #339 |
Tue, 03 Jul, 08:33 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #299 |
Thu, 05 Jul, 04:04 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1887 |
Thu, 05 Jul, 04:06 |
Apache Jenkins Server |
Build failed in Jenkins: nutch-trunk-maven #342 |
Fri, 06 Jul, 05:53 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-nutchgora #300 |
Fri, 06 Jul, 05:58 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-trunk #1888 |
Fri, 06 Jul, 06:09 |
Apache Jenkins Server |
Build failed in Jenkins: nutch-trunk-maven #343 |
Sat, 07 Jul, 05:01 |
Apache Jenkins Server |
Build failed in Jenkins: nutch-trunk-maven #344 |
Sun, 08 Jul, 05:02 |
Apache Jenkins Server |
Build failed in Jenkins: nutch-trunk-maven #345 |
Mon, 09 Jul, 05:02 |
Apache Jenkins Server |
Build failed in Jenkins: nutch-trunk-maven #346 |
Tue, 10 Jul, 10:05 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #304 |
Tue, 10 Jul, 10:35 |
Apache Jenkins Server |
Build failed in Jenkins: nutch-trunk-maven #347 |
Tue, 10 Jul, 17:45 |
Apache Jenkins Server |
Build failed in Jenkins: nutch-trunk-maven #348 |
Tue, 10 Jul, 22:16 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #305 |
Wed, 11 Jul, 04:07 |
Apache Jenkins Server |
Jenkins build is back to normal : nutch-trunk-maven #349 |
Wed, 11 Jul, 05:04 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #306 |
Thu, 12 Jul, 05:08 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #307 |
Fri, 13 Jul, 04:15 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #308 |
Sat, 14 Jul, 04:05 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1896 |
Sat, 14 Jul, 04:07 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #309 |
Sun, 15 Jul, 04:06 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-trunk #1897 |
Sun, 15 Jul, 04:18 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #310 |
Mon, 16 Jul, 04:06 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #311 |
Tue, 17 Jul, 04:09 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #312 |
Wed, 18 Jul, 04:06 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #313 |
Thu, 19 Jul, 04:11 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #314 |
Fri, 20 Jul, 04:09 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1904 |
Sun, 22 Jul, 04:05 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-trunk #1905 |
Mon, 23 Jul, 04:17 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1908 |
Thu, 26 Jul, 04:06 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-trunk #1909 |
Fri, 27 Jul, 04:20 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "ContributorsGroup" by LewisJohnMcgibbney |
Fri, 06 Jul, 20:36 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "AdminGroup" by LewisJohnMcgibbney |
Fri, 06 Jul, 20:36 |
Apache Wiki |
[Nutch Wiki] Update of "Support" by subhankarray |
Sat, 07 Jul, 00:45 |
Apache Wiki |
[Nutch Wiki] Update of "FAQ" by JulienNioche |
Mon, 09 Jul, 15:35 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney |
Tue, 10 Jul, 20:46 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney |
Tue, 10 Jul, 20:46 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "RunNutchInEclipse" by SebastianNagel |
Sun, 22 Jul, 19:24 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "RunNutchInEclipse" by LewisJohnMcgibbney |
Tue, 24 Jul, 14:13 |
Apache Wiki |
[Nutch Wiki] Update of "Presentations" by JulienNioche |
Thu, 26 Jul, 15:00 |
Apache Wiki |
[Nutch Wiki] Update of "Presentations" by JulienNioche |
Thu, 26 Jul, 15:01 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "Presentations" by JulienNioche |
Thu, 26 Jul, 15:02 |
Arijit Mukherjee (JIRA) |
[jira] [Created] (NUTCH-1418) error parsing robots rules- can't decode path: /wiki/Wikipedia%3Mediation_Committee/ |
Mon, 02 Jul, 18:07 |
Arijit Mukherjee (JIRA) |
[jira] [Commented] (NUTCH-1418) error parsing robots rules- can't decode path: /wiki/Wikipedia%3Mediation_Committee/ |
Tue, 03 Jul, 05:51 |
Ferdy Galema |
Re: [PROPOSAL] Rename branch nutchgora into 2.x |
Mon, 09 Jul, 11:02 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1306) Add option to not commit and clarify existing solr.commit.size |
Wed, 04 Jul, 08:34 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1306) Add option to not commit and clarify existing solr.commit.size |
Wed, 04 Jul, 08:36 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling |
Wed, 04 Jul, 13:08 |
Ferdy Galema (JIRA) |
[jira] [Comment Edited] (NUTCH-1360) Suport the storing of IP address connected to when web crawling |
Wed, 04 Jul, 13:42 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling |
Wed, 04 Jul, 14:40 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1306) Add option to not commit and clarify existing solr.commit.size |
Mon, 09 Jul, 11:33 |
Ferdy Galema (JIRA) |
[jira] [Resolved] (NUTCH-1306) Add option to not commit and clarify existing solr.commit.size |
Mon, 09 Jul, 11:40 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1423) Remove unused fields in LanguageIndexingFilter |
Mon, 09 Jul, 11:42 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1423) Remove unused fields in LanguageIndexingFilter |
Mon, 09 Jul, 11:44 |
Ferdy Galema (JIRA) |
[jira] [Closed] (NUTCH-1423) Remove unused fields in LanguageIndexingFilter |
Mon, 09 Jul, 11:44 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1424) fix fetcher timelimit logging |
Mon, 09 Jul, 11:48 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1424) fix fetcher timelimit logging |
Mon, 09 Jul, 11:50 |
Ferdy Galema (JIRA) |
[jira] [Closed] (NUTCH-1424) fix fetcher timelimit logging |
Mon, 09 Jul, 11:50 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1425) DbUpdaterJob declares PREV_SIGNATURE on input twice |
Mon, 09 Jul, 11:54 |
Ferdy Galema (JIRA) |
[jira] [Closed] (NUTCH-1425) DbUpdaterJob declares PREV_SIGNATURE on input twice |
Mon, 09 Jul, 11:54 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1425) DbUpdaterJob declares PREV_SIGNATURE on input twice |
Mon, 09 Jul, 11:54 |
Ferdy Galema (JIRA) |
[jira] [Resolved] (NUTCH-1025) Add option not to commit to Solr |
Mon, 09 Jul, 11:57 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1306) Add option to not commit and clarify existing solr.commit.size |
Mon, 09 Jul, 11:57 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1426) HostDb close() should close store instead of flush |
Mon, 09 Jul, 12:15 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1426) HostDb close() should close store instead of flush |
Mon, 09 Jul, 12:17 |
Ferdy Galema (JIRA) |
[jira] [Closed] (NUTCH-1426) HostDb close() should close store instead of flush |
Mon, 09 Jul, 12:19 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1411) nutchgora fetcher.store.content does not work |
Mon, 09 Jul, 12:46 |
Ferdy Galema (JIRA) |
[jira] [Closed] (NUTCH-628) Host database to keep track of host-level information |
Mon, 09 Jul, 13:54 |
Ferdy Galema (JIRA) |
[jira] [Resolved] (NUTCH-1411) nutchgora fetcher.store.content does not work |
Mon, 09 Jul, 15:23 |
Ferdy Galema (JIRA) |
[jira] [Closed] (NUTCH-1411) nutchgora fetcher.store.content does not work |
Mon, 09 Jul, 15:23 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1427) Reuse SelectorEntry in Generator. |
Tue, 10 Jul, 14:23 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1427) Reuse SelectorEntry in Generator. |
Tue, 10 Jul, 14:27 |
Ferdy Galema (JIRA) |
[jira] [Closed] (NUTCH-1427) Reuse SelectorEntry in Generator. |
Tue, 10 Jul, 14:27 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1428) GeneratorMapper should not initialize filters/normalizers when they are disabled |
Tue, 10 Jul, 15:00 |
Ferdy Galema (JIRA) |
[jira] [Closed] (NUTCH-1428) GeneratorMapper should not initialize filters/normalizers when they are disabled |
Tue, 10 Jul, 15:02 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1428) GeneratorMapper should not initialize filters/normalizers when they are disabled |
Tue, 10 Jul, 15:02 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling |
Tue, 10 Jul, 20:52 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1431) Introduce link 'distance' and add configurable max distance in the generator |
Wed, 18 Jul, 10:19 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1431) Introduce link 'distance' and add configurable max distance in the generator |
Wed, 18 Jul, 10:23 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1431) Introduce link 'distance' and add configurable max distance in the generator |
Wed, 18 Jul, 11:40 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration |
Wed, 18 Jul, 14:02 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration |
Wed, 18 Jul, 14:25 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration |
Wed, 18 Jul, 14:25 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1432) property storage.schema does not work anymore, should be storage.schema.webpage and storage.schema.host |
Thu, 19 Jul, 07:35 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration |
Thu, 19 Jul, 10:01 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration |
Fri, 20 Jul, 08:31 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1437) HostInjectorJob to accept lines with or without protocol |
Wed, 25 Jul, 12:51 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1437) HostInjectorJob to accept lines with or without protocol |
Wed, 25 Jul, 12:51 |
Ferdy Galema (JIRA) |
[jira] [Closed] (NUTCH-1437) HostInjectorJob to accept lines with or without protocol |
Wed, 25 Jul, 12:53 |
Ferdy Galema (JIRA) |
[jira] [Reopened] (NUTCH-1437) HostInjectorJob to accept lines with or without protocol |
Wed, 25 Jul, 12:53 |
Ferdy Galema (JIRA) |
[jira] [Closed] (NUTCH-1437) HostInjectorJob to accept lines with or without protocol |
Wed, 25 Jul, 12:53 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration |
Wed, 25 Jul, 15:25 |