|
[jira] [Assigned] (NUTCH-2770) Subcollection logic allows empty string as a whitelist value, thus matching every incoming document. |
|
Sebastian Nagel (Jira) |
[jira] [Assigned] (NUTCH-2770) Subcollection logic allows empty string as a whitelist value, thus matching every incoming document. |
Fri, 13 Mar, 08:34 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2770) Subcollection logic allows empty string as a whitelist value, thus matching every incoming document. |
Fri, 13 Mar, 08:34 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2770) Subcollection logic allows empty string as a whitelist value, thus matching every incoming document. |
Fri, 13 Mar, 08:34 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-2770) Subcollection logic allows empty string as a whitelist value, thus matching every incoming document. |
Fri, 13 Mar, 09:09 |
|
[jira] [Commented] (NUTCH-2772) Debugging parse filter to show serialized DOM tree |
|
Sebastian Nagel (Jira) |
[jira] [Commented] (NUTCH-2772) Debugging parse filter to show serialized DOM tree |
Fri, 13 Mar, 08:42 |
|
[jira] [Commented] (NUTCH-2774) Annotate methods implementing the Hadoop API by @Override |
|
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2774) Annotate methods implementing the Hadoop API by @Override |
Fri, 13 Mar, 08:47 |
Sebastian Nagel (Jira) |
[jira] [Work started] (NUTCH-2774) Annotate methods implementing the Hadoop API by @Override |
Fri, 13 Mar, 08:47 |
Sebastian Nagel (Jira) |
[jira] [Assigned] (NUTCH-2774) Annotate methods implementing the Hadoop API by @Override |
Fri, 13 Mar, 08:47 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2774) Annotate methods implementing the Hadoop API by @Override |
Fri, 13 Mar, 08:48 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-2774) Annotate methods implementing the Hadoop API by @Override |
Fri, 13 Mar, 10:03 |
|
[jira] [Commented] (NUTCH-2773) SegmentReader (-dump or -get): show HTML content as UTF-8 |
|
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2773) SegmentReader (-dump or -get): show HTML content as UTF-8 |
Fri, 13 Mar, 09:09 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2773) SegmentReader (-dump or -get): show HTML content as UTF-8 |
Fri, 13 Mar, 09:10 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-2773) SegmentReader (-dump or -get): show HTML content as UTF-8 |
Fri, 13 Mar, 10:03 |
|
[jira] [Assigned] (NUTCH-2775) Fetcher to guarantee minimum delay even if robots.txt defines shorter Crawl-delay |
|
Sebastian Nagel (Jira) |
[jira] [Assigned] (NUTCH-2775) Fetcher to guarantee minimum delay even if robots.txt defines shorter Crawl-delay |
Fri, 20 Mar, 10:21 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2775) Fetcher to guarantee minimum delay even if robots.txt defines shorter Crawl-delay |
Wed, 25 Mar, 09:43 |
|
[jira] [Created] (NUTCH-2776) Fetcher to temporarily deduplicate followed redirects |
|
Sebastian Nagel (Jira) |
[jira] [Created] (NUTCH-2776) Fetcher to temporarily deduplicate followed redirects |
Fri, 20 Mar, 18:53 |
Sebastian Nagel (Jira) |
[jira] [Assigned] (NUTCH-2776) Fetcher to temporarily deduplicate followed redirects |
Fri, 20 Mar, 18:53 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2776) Fetcher to temporarily deduplicate followed redirects |
Fri, 20 Mar, 19:16 |
|
[jira] [Created] (NUTCH-2777) Upgrade to Hadoop 3.1 |
|
Sebastian Nagel (Jira) |
[jira] [Created] (NUTCH-2777) Upgrade to Hadoop 3.1 |
Mon, 23 Mar, 06:59 |
Shashanka Balakuntala Srinivasa (Jira) |
[jira] [Assigned] (NUTCH-2777) Upgrade to Hadoop 3.1 |
Tue, 24 Mar, 06:41 |
Shashanka Balakuntala Srinivasa (Jira) |
[jira] [Commented] (NUTCH-2777) Upgrade to Hadoop 3.1 |
Tue, 24 Mar, 06:43 |
Sebastian Nagel (Jira) |
[jira] [Commented] (NUTCH-2777) Upgrade to Hadoop 3.1 |
Tue, 24 Mar, 08:01 |