Roannel Fernández Hernández (JIRA) |
[jira] [Created] (NUTCH-2580) Improvements for Rabbitmq support |
Mon, 21 May, 14:21 |
Roannel Fernández Hernández (JIRA) |
[jira] [Updated] (NUTCH-2580) Improvements for Rabbitmq support |
Mon, 21 May, 14:40 |
Roannel Fernández Hernández (JIRA) |
[jira] [Updated] (NUTCH-2580) Improvements for Rabbitmq support |
Thu, 24 May, 20:15 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2575) protocol-http does not respect the maximum content-size |
Sun, 06 May, 10:28 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2576) HTTP protocol plugin based on okhttp |
Wed, 09 May, 12:01 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2576) HTTP protocol plugin based on okhttp |
Wed, 09 May, 12:53 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2576) HTTP protocol plugin based on okhttp |
Wed, 09 May, 14:20 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2575) protocol-http does not respect the maximum content-size |
Thu, 10 May, 09:41 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script |
Thu, 10 May, 12:16 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2575) protocol-http does not respect the maximum content-size for chunked responses |
Thu, 10 May, 21:02 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2562) protocol-http fails to read large chunked HTTP responses |
Thu, 10 May, 21:36 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2575) protocol-http does not respect the maximum content-size for chunked responses |
Fri, 11 May, 09:55 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2577) protocol-selenium can't handle https |
Tue, 15 May, 15:19 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1480) SolrIndexer to write to multiple servers. |
Sun, 20 May, 23:52 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2581) Caching of redirected robots.txt may overwrite correct robots.txt rules |
Tue, 22 May, 13:08 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2500) Add pull-reqest template to github |
Wed, 23 May, 10:50 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2577) protocol-selenium can't handle https |
Wed, 23 May, 11:14 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2577) protocol-selenium can't handle https |
Wed, 23 May, 16:20 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2500) Add pull-reqest template to github |
Wed, 23 May, 16:57 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2500) Add pull-reqest template to github |
Thu, 24 May, 12:28 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2576) HTTP protocol plugin based on okhttp |
Thu, 24 May, 13:21 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2576) HTTP protocol plugin based on okhttp |
Thu, 24 May, 15:45 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2579) Fetcher to use parsed URL to call ProtocolFactory.getProtocol(url) |
Thu, 24 May, 15:58 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2580) Improvements for Rabbitmq support |
Thu, 24 May, 20:44 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2584) Upgrade parse-tika to use Tika 1.18 |
Fri, 25 May, 13:19 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2562) protocol-http fails to read large chunked HTTP responses |
Tue, 29 May, 16:17 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2562) protocol-http fails to read large chunked HTTP responses |
Wed, 30 May, 10:39 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1480) SolrIndexer to write to multiple servers. |
Thu, 31 May, 15:06 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2580) Improvements for Rabbitmq support |
Thu, 31 May, 15:33 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2590) SegmentReader -get fails |
Thu, 31 May, 15:59 |
Cihad Guzel |
A Hadoop documentation issue about Nutch |
Tue, 29 May, 08:55 |
Gerard Bouchar (JIRA) |
[jira] [Commented] (NUTCH-2575) protocol-http does not respect the maximum content-size for chunked responses |
Thu, 24 May, 12:28 |
Gerard Bouchar (JIRA) |
[jira] [Commented] (NUTCH-2549) protocol-http does not behave the same as browsers |
Thu, 24 May, 12:29 |
Gerard Bouchar (JIRA) |
[jira] [Updated] (NUTCH-2549) protocol-http does not behave the same as browsers |
Thu, 24 May, 12:29 |
Gerard Bouchar (JIRA) |
[jira] [Created] (NUTCH-2586) Add a fallback mechanism for missing meta tags |
Mon, 28 May, 14:30 |
Gerard Bouchar (JIRA) |
[jira] [Created] (NUTCH-2587) Tests do not pass |
Mon, 28 May, 15:16 |
Gerard Bouchar (JIRA) |
[jira] [Closed] (NUTCH-2587) Tests do not pass |
Tue, 29 May, 08:47 |
Gerard Bouchar (JIRA) |
[jira] [Created] (NUTCH-2589) HTML redirections are not followed when using parse-tika |
Tue, 29 May, 16:01 |
Gerard Bouchar (JIRA) |
[jira] [Updated] (NUTCH-2589) HTML redirections are not followed when using parse-tika |
Tue, 29 May, 16:02 |
Gerard Bouchar (JIRA) |
[jira] [Updated] (NUTCH-2589) HTML redirections are not followed when using parse-tika |
Tue, 29 May, 16:09 |
Gerard Bouchar (JIRA) |
[jira] [Commented] (NUTCH-2589) HTML redirections are not followed when using parse-tika |
Wed, 30 May, 12:54 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2513) ant eclipse target fails with "protocol switch unsafe" |
Tue, 08 May, 11:57 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2513) ant eclipse target fails with "protocol switch unsafe" |
Tue, 08 May, 12:11 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2575) protocol-http does not respect the maximum content-size for chunked responses |
Thu, 10 May, 22:17 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2577) protocol-selenium can't handle https |
Wed, 23 May, 16:54 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2500) Add pull-reqest template to github |
Thu, 24 May, 12:45 |
Hudson (JIRA) |
[jira] [Commented] (NUTCH-2500) Add pull-reqest template to github |
Thu, 24 May, 12:54 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-2512) Nutch 1.14 does not work under JDK9 |
Tue, 22 May, 20:59 |
Markus Jelsma (JIRA) |
[jira] [Created] (NUTCH-2585) NPE in TrieStringMatcher |
Fri, 25 May, 14:33 |
Michael Coffey (JIRA) |
[jira] [Commented] (NUTCH-2468) should filter out invalid URLs by default |
Thu, 03 May, 18:51 |
Michael Coffey (JIRA) |
[jira] [Comment Edited] (NUTCH-2468) should filter out invalid URLs by default |
Thu, 03 May, 18:52 |
Omkar Reddy (JIRA) |
[jira] [Commented] (NUTCH-2575) protocol-http does not respect the maximum content-size |
Sun, 06 May, 09:19 |
Omkar Reddy (JIRA) |
[jira] [Commented] (NUTCH-2575) protocol-http does not respect the maximum content-size for chunked responses |
Thu, 24 May, 12:34 |
Omkar Reddy (JIRA) |
[jira] [Commented] (NUTCH-2557) protocol-http fails to follow redirections when an HTTP response body is invalid |
Fri, 25 May, 11:32 |
Ralf (JIRA) |
[jira] [Commented] (NUTCH-2290) Update licenses of bundled libraries |
Tue, 22 May, 20:35 |
Ralf (JIRA) |
[jira] [Commented] (NUTCH-2512) Nutch 1.14 does not work under JDK9 |
Tue, 22 May, 20:48 |
Ralf (JIRA) |
[jira] [Commented] (NUTCH-2290) Update licenses of bundled libraries |
Thu, 24 May, 13:31 |
Ralf (JIRA) |
[jira] [Created] (NUTCH-2583) Upgrading Nutch's dependencies |
Thu, 24 May, 13:48 |
Ralf (JIRA) |
[jira] [Updated] (NUTCH-2583) Upgrading Nutch's dependencies |
Thu, 24 May, 13:56 |
Ralf (JIRA) |
[jira] [Updated] (NUTCH-2583) Upgrading Nutch's dependencies |
Thu, 24 May, 13:59 |
Ralf (JIRA) |
[jira] [Updated] (NUTCH-2583) Upgrading Nutch's dependencies |
Thu, 24 May, 14:05 |
Ralf (JIRA) |
[jira] [Commented] (NUTCH-2584) Upgrade parse-tika to use Tika 1.18 |
Thu, 24 May, 16:22 |
Ralf (JIRA) |
[jira] [Commented] (NUTCH-2584) Upgrade parse-tika to use Tika 1.18 |
Fri, 25 May, 17:24 |
Rich Bowen |
ApacheCon North America 2018 schedule is now live. |
Tue, 01 May, 12:36 |
Sebastian Nagel |
Re: A Hadoop documentation issue about Nutch |
Wed, 30 May, 11:06 |
Sebastian Nagel (JIRA) |
[jira] [Assigned] (NUTCH-2513) ant eclipse protocol unsafe |
Tue, 08 May, 11:23 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-2513) ant eclipse target fails with "protocol switch unsafe" |
Tue, 08 May, 11:28 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-2513) ant eclipse target fails with "protocol switch unsafe" |
Tue, 08 May, 11:28 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-2513) ant eclipse target fails with "protocol switch unsafe" |
Tue, 08 May, 11:28 |
Sebastian Nagel (JIRA) |
[jira] [Resolved] (NUTCH-2513) ant eclipse target fails with "protocol switch unsafe" |
Tue, 08 May, 11:33 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-2513) ant eclipse target fails with "protocol switch unsafe" |
Tue, 08 May, 11:33 |
Sebastian Nagel (JIRA) |
[jira] [Created] (NUTCH-2576) HTTP protocol plugin based on okhttp |
Wed, 09 May, 11:48 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-2576) HTTP protocol plugin based on okhttp |
Wed, 09 May, 11:52 |
Sebastian Nagel (JIRA) |
[jira] [Resolved] (NUTCH-2514) Segmentation Fault issue while running crawl job. |
Thu, 10 May, 11:00 |
Sebastian Nagel (JIRA) |
[jira] [Resolved] (NUTCH-2161) Interrupted failed and/or killed tasks fail to clean up temp directories in HDFS |
Thu, 10 May, 12:59 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-2575) protocol-http does not respect the maximum content-size for chunked responses |
Thu, 10 May, 13:52 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-2575) protocol-http does not respect the maximum content-size for chunked responses |
Thu, 10 May, 21:02 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-2575) protocol-http does not respect the maximum content-size for chunked responses |
Thu, 10 May, 21:03 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-2575) protocol-http does not respect the maximum content-size for chunked responses |
Thu, 10 May, 21:03 |
Sebastian Nagel (JIRA) |
[jira] [Resolved] (NUTCH-2575) protocol-http does not respect the maximum content-size for chunked responses |
Thu, 10 May, 21:04 |
Sebastian Nagel (JIRA) |
[jira] [Commented] (NUTCH-2562) protocol-http fails to read large chunked HTTP responses |
Thu, 10 May, 21:22 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-2562) protocol-http fails to read large chunked HTTP responses |
Thu, 10 May, 21:36 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-2562) protocol-http fails to read large chunked HTTP responses |
Thu, 10 May, 21:36 |
Sebastian Nagel (JIRA) |
[jira] [Assigned] (NUTCH-2574) hostCount >= maxCount comparison wrong |
Fri, 11 May, 14:03 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-2574) hostCount >= maxCount comparison wrong |
Fri, 11 May, 14:03 |
Sebastian Nagel (JIRA) |
[jira] [Commented] (NUTCH-2468) should filter out invalid URLs by default |
Fri, 11 May, 14:37 |
Sebastian Nagel (JIRA) |
[jira] [Created] (NUTCH-2578) Avoid lock by MimeUtil in constructor of protocol.Content |
Thu, 17 May, 12:17 |
Sebastian Nagel (JIRA) |
[jira] [Commented] (NUTCH-2578) Avoid lock by MimeUtil in constructor of protocol.Content |
Thu, 17 May, 14:49 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-2578) Avoid lock by MimeUtil in constructor of protocol.Content |
Fri, 18 May, 15:09 |
Sebastian Nagel (JIRA) |
[jira] [Created] (NUTCH-2579) Fetcher to use parsed URL to call ProtocolFactory.getProtocol(url) |
Fri, 18 May, 16:17 |
Sebastian Nagel (JIRA) |
[jira] [Commented] (NUTCH-2578) Avoid lock by MimeUtil in constructor of protocol.Content |
Fri, 18 May, 16:28 |
Sebastian Nagel (JIRA) |
[jira] [Created] (NUTCH-2581) Caching of redirected robots.txt may overwrite correct robots.txt rules |
Tue, 22 May, 13:04 |
Sebastian Nagel (JIRA) |
[jira] [Assigned] (NUTCH-2581) Caching of redirected robots.txt may overwrite correct robots.txt rules |
Tue, 22 May, 13:08 |
Sebastian Nagel (JIRA) |
[jira] [Created] (NUTCH-2582) Set pool size of XML SAX parsers used for MIME detection in Tika 1.19 |
Tue, 22 May, 15:56 |
Sebastian Nagel (JIRA) |
[jira] [Commented] (NUTCH-2582) Set pool size of XML SAX parsers used for MIME detection in Tika 1.19 |
Tue, 22 May, 15:58 |
Sebastian Nagel (JIRA) |
[jira] [Resolved] (NUTCH-2577) protocol-selenium can't handle https |
Wed, 23 May, 16:22 |
Sebastian Nagel (JIRA) |
[jira] [Resolved] (NUTCH-2273) Selenium and InteractiveSelenium Do Not Support HTTPS |
Wed, 23 May, 16:23 |
Sebastian Nagel (JIRA) |
[jira] [Resolved] (NUTCH-2310) Protocol-Selenium does not support HTTPS protocol |
Wed, 23 May, 16:23 |
Sebastian Nagel (JIRA) |
[jira] [Commented] (NUTCH-2512) Nutch 1.14 does not work under JDK9 |
Thu, 24 May, 09:21 |
Sebastian Nagel (JIRA) |
[jira] [Commented] (NUTCH-2290) Update licenses of bundled libraries |
Thu, 24 May, 11:42 |