Markus Jelsma |
RE: Recent stackoverflow questions |
Tue, 05 Apr, 23:03 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2245) Developed the NGram Model on the existing Unigram Cosine Similarity Model |
Fri, 01 Apr, 23:14 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2245) Developed the NGram Model on the existing Unigram Cosine Similarity Model |
Fri, 01 Apr, 23:15 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2245) Developed the NGram Model on the existing Unigram Cosine Similarity Model |
Fri, 01 Apr, 23:22 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2245) Developed the NGram Model on the existing Unigram Cosine Similarity Model |
Sun, 03 Apr, 02:18 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2245) Developed the NGram Model on the existing Unigram Cosine Similarity Model |
Sun, 03 Apr, 02:41 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2245) Developed the NGram Model on the existing Unigram Cosine Similarity Model |
Mon, 04 Apr, 06:47 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2222) re-fetch deletes all metadata except _csh_ and _rs_ |
Thu, 07 Apr, 15:16 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2248) CSS parser plugin |
Thu, 07 Apr, 21:47 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2248) CSS parser plugin |
Mon, 11 Apr, 17:11 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2248) CSS parser plugin |
Mon, 11 Apr, 17:12 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2238) Indexer for Elasticsearch 2.x |
Mon, 11 Apr, 19:25 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2238) Indexer for Elasticsearch 2.x |
Mon, 11 Apr, 19:25 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2238) Indexer for Elasticsearch 2.x |
Mon, 11 Apr, 19:26 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2238) Indexer for Elasticsearch 2.x |
Wed, 13 Apr, 18:29 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2250) CommonCrawlDumper : Invalid format + skipped parts |
Thu, 14 Apr, 09:45 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2250) CommonCrawlDumper : Invalid format + skipped parts |
Fri, 15 Apr, 03:20 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2250) CommonCrawlDumper : Invalid format + skipped parts |
Fri, 15 Apr, 05:13 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2250) CommonCrawlDumper : Invalid format + skipped parts |
Sun, 17 Apr, 22:32 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2191) Add protocol-htmlunit |
Sun, 17 Apr, 22:36 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2191) Add protocol-htmlunit |
Mon, 18 Apr, 07:48 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2191) Add protocol-htmlunit |
Mon, 18 Apr, 09:44 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2254) Charset issues when using -addBinaryContent and -base64 options |
Mon, 25 Apr, 13:24 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2254) Charset issues when using -addBinaryContent and -base64 options |
Wed, 27 Apr, 21:02 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #3358 |
Mon, 04 Apr, 07:57 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #1551 |
Thu, 07 Apr, 15:44 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #1552 |
Wed, 13 Apr, 18:45 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #3359 |
Sun, 17 Apr, 22:44 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #3360 |
Mon, 18 Apr, 10:56 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-trunk #3361 |
Wed, 20 Apr, 21:03 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-nutchgora #1553 |
Wed, 20 Apr, 21:28 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #1554 |
Fri, 29 Apr, 17:39 |
Apache Wiki |
[Nutch Wiki] Update of "AdvancedAjaxInteraction" by ChrisMattmann |
Wed, 13 Apr, 05:42 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "bin/nutch_generate" by SebastianNagel |
Tue, 26 Apr, 11:47 |
Bhavya Sanghavi |
Reg. License of Princeton WordNet |
Fri, 01 Apr, 23:07 |
Bhavya Sanghavi (JIRA) |
[jira] [Created] (NUTCH-2249) WordNet Integration for Cosine Similarity |
Tue, 12 Apr, 22:16 |
Bhavya Sanghavi (JIRA) |
[jira] [Commented] (NUTCH-2249) WordNet Integration for Cosine Similarity |
Tue, 12 Apr, 22:18 |
BlackIce |
Re: Maven Central Plugins |
Sun, 10 Apr, 14:26 |
Chris A. Mattmann (JIRA) |
[jira] [Updated] (NUTCH-2191) Add protocol-htmlunit |
Fri, 08 Apr, 16:13 |
Chris A. Mattmann (JIRA) |
[jira] [Assigned] (NUTCH-2250) CommonCrawlDumper : Invalid format + skipped parts |
Fri, 15 Apr, 05:36 |
Chris A. Mattmann (JIRA) |
[jira] [Work started] (NUTCH-2250) CommonCrawlDumper : Invalid format + skipped parts |
Fri, 15 Apr, 05:36 |
Chris A. Mattmann (JIRA) |
[jira] [Resolved] (NUTCH-2250) CommonCrawlDumper : Invalid format + skipped parts |
Sun, 17 Apr, 22:33 |
Chris A. Mattmann (JIRA) |
[jira] [Updated] (NUTCH-2250) CommonCrawlDumper : Invalid format + skipped parts |
Sun, 17 Apr, 22:33 |
Chris A. Mattmann (JIRA) |
[jira] [Resolved] (NUTCH-2191) Add protocol-htmlunit |
Sun, 17 Apr, 22:37 |
Federico Bonelli (JIRA) |
[jira] [Commented] (NUTCH-1785) Ability to index raw content |
Wed, 20 Apr, 08:09 |
Federico Bonelli (JIRA) |
[jira] [Created] (NUTCH-2254) Charset issues when using -addBinaryContent and -base64 options |
Thu, 21 Apr, 12:36 |
Federico Bonelli (JIRA) |
[jira] [Updated] (NUTCH-2254) Charset issues when using -addBinaryContent and -base64 options |
Thu, 21 Apr, 12:41 |
Federico Bonelli (JIRA) |
[jira] [Comment Edited] (NUTCH-2254) Charset issues when using -addBinaryContent and -base64 options |
Thu, 21 Apr, 12:42 |
Federico Bonelli (JIRA) |
[jira] [Commented] (NUTCH-1785) Ability to index raw content |
Thu, 21 Apr, 12:43 |
Federico Bonelli (JIRA) |
[jira] [Commented] (NUTCH-2254) Charset issues when using -addBinaryContent and -base64 options |
Tue, 26 Apr, 07:30 |
Furkan KAMACI |
GSoC Acceptance for Security Layer for NutchServer (NUTCH-1756) |
Mon, 25 Apr, 13:25 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2245) Developed the NGram Model on the existing Unigram Cosine Similarity Model |
Mon, 04 Apr, 07:57 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2222) re-fetch deletes all metadata except _csh_ and _rs_ |
Thu, 07 Apr, 15:45 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2238) Indexer for Elasticsearch 2.x |
Wed, 13 Apr, 18:46 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2191) Add protocol-htmlunit |
Sun, 17 Apr, 22:45 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2250) CommonCrawlDumper : Invalid format + skipped parts |
Sun, 17 Apr, 22:45 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2191) Add protocol-htmlunit |
Mon, 18 Apr, 10:56 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2254) Charset issues when using -addBinaryContent and -base64 options |
Wed, 27 Apr, 21:58 |
Jason Wang (JIRA) |
[jira] [Commented] (NUTCH-1824) protocol-http using proxy not working with https sites |
Fri, 29 Apr, 02:27 |
Jean Vence |
Adding a new field to Nutch + MongoDB datastore using plugin |
Wed, 13 Apr, 11:49 |
Joseph Naegele (JIRA) |
[jira] [Created] (NUTCH-2248) CSS parser plugin |
Thu, 07 Apr, 21:43 |
Julien Nioche (JIRA) |
[jira] [Created] (NUTCH-2255) WARCExporter to generate request records |
Wed, 27 Apr, 12:46 |
Julien Nioche (JIRA) |
[jira] [Updated] (NUTCH-2255) WARCExporter to generate request records |
Wed, 27 Apr, 12:50 |
Karanjeet Singh |
Re: Nutch: Tika Parser error while parsing an image |
Fri, 08 Apr, 10:18 |
Karanjeet Singh |
Re: Nutch: Tika Parser error while parsing an image |
Fri, 08 Apr, 23:19 |
Karanjeet Singh (JIRA) |
[jira] [Commented] (NUTCH-2191) Add protocol-htmlunit |
Mon, 18 Apr, 07:22 |
Karanjeet Singh (JIRA) |
[jira] [Commented] (NUTCH-2191) Add protocol-htmlunit |
Mon, 18 Apr, 07:52 |
Kim Whitehall (JIRA) |
[jira] [Created] (NUTCH-2252) Allow phantomjs as a browser for selenium options |
Sat, 16 Apr, 16:08 |
Kim Whitehall (JIRA) |
[jira] [Commented] (NUTCH-2252) Allow phantomjs as a browser for selenium options |
Sat, 16 Apr, 20:00 |
Leon Misakyan (JIRA) |
[jira] [Commented] (NUTCH-1604) ProtocolFactory not thread-safe |
Tue, 19 Apr, 17:09 |
Leon Misakyan (JIRA) |
[jira] [Created] (NUTCH-2253) ProtocolFactory still not thread-safe |
Wed, 20 Apr, 15:54 |
Leon Misakyan (JIRA) |
[jira] [Updated] (NUTCH-2253) ProtocolFactory still not thread-safe |
Wed, 20 Apr, 15:55 |
Leon Misakyan (JIRA) |
[jira] [Updated] (NUTCH-2253) ProtocolFactory still not thread-safe |
Wed, 20 Apr, 15:55 |
Leon Misakyan (JIRA) |
[jira] [Updated] (NUTCH-2253) ProtocolFactory still not thread-safe |
Wed, 20 Apr, 15:56 |
Leon Misakyan (JIRA) |
[jira] [Updated] (NUTCH-2253) ProtocolFactory still not thread-safe |
Thu, 21 Apr, 07:39 |
Leon Misakyan (JIRA) |
[jira] [Updated] (NUTCH-2253) ProtocolFactory still not thread-safe |
Thu, 21 Apr, 10:27 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2191) Add protocol-htmlunit |
Fri, 01 Apr, 17:03 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2222) re-fetch deletes all metadata except _csh_ and _rs_ |
Thu, 07 Apr, 15:19 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-2238) Indexer for Elasticsearch 2.x |
Wed, 13 Apr, 18:30 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-2238) Indexer for Elasticsearch 2.x |
Wed, 13 Apr, 18:30 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-2222) re-fetch deletes all metadata except _csh_ and _rs_ |
Wed, 13 Apr, 18:31 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-1741) Support of Sitemaps in Nutch 2.x |
Wed, 13 Apr, 18:32 |
Lewis John McGibbney (JIRA) |
[jira] [Resolved] (NUTCH-2238) Indexer for Elasticsearch 2.x |
Wed, 13 Apr, 18:33 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-2217) Crawl pages with specified language |
Wed, 13 Apr, 18:38 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-2188) While crawling with solr url (kerberos enabled) Error: org.apache.solr.common.SolrException: Unauthorized |
Wed, 13 Apr, 18:39 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-2252) Allow phantomjs as a browser for selenium options |
Sat, 16 Apr, 21:05 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-2252) Allow phantomjs as a browser for selenium options |
Sat, 16 Apr, 21:05 |
Lewis John McGibbney (JIRA) |
[jira] [Assigned] (NUTCH-2252) Allow phantomjs as a browser for selenium options |
Sat, 16 Apr, 21:05 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-1756) Security layer for NutchServer |
Wed, 27 Apr, 18:58 |
Lewis John Mcgibbney |
Maven Central Plugins |
Sun, 10 Apr, 13:54 |
Lewis John Mcgibbney |
Re: GSoC 2016: You are a mentor for Furkan KAMACI |
Fri, 22 Apr, 19:33 |
Markus Jelsma (JIRA) |
[jira] [Created] (NUTCH-2247) Protocol resolver |
Wed, 06 Apr, 16:25 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-2247) Protocol resolver |
Wed, 06 Apr, 16:25 |
Mattmann, Chris A (3980) |
Re: Recent stackoverflow questions |
Tue, 05 Apr, 23:13 |
Mattmann, Chris A (3980) |
FW: Apache Tika used to parse the Panama papers! |
Tue, 05 Apr, 23:13 |
Mattmann, Chris A (3980) |
Re: Maven Central Plugins |
Sun, 10 Apr, 15:45 |
Mattmann, Chris A (3980) |
Re: Jenkins build failures after git migration |
Mon, 18 Apr, 15:56 |
Mattmann, Chris A (3980) |
Re: Jenkins build failures after git migration |
Thu, 21 Apr, 14:19 |
Mattmann, Chris A (3980) |
Need to update version control page and SVN docs to point to Git |
Fri, 29 Apr, 01:14 |
Sebastian Nagel |
Jenkins build failures after git migration |
Mon, 18 Apr, 11:40 |