nutch-dev mailing list archives: January 2018

Site index · List index
Message listThread · Author · Date
[jira] [Created] (NUTCH-2490) Sitemap processing: Sitemap index files not working
Moreno Feltscher (JIRA)   [jira] [Created] (NUTCH-2490) Sitemap processing: Sitemap index files not working Tue, 02 Jan, 22:51
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2490) Sitemap processing: Sitemap index files not working Tue, 02 Jan, 22:55
Moreno Feltscher (JIRA)   [jira] [Updated] (NUTCH-2490) Sitemap processing: Sitemap index files not working Tue, 02 Jan, 23:41
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2490) Sitemap processing: Sitemap index files not working Wed, 03 Jan, 01:16
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2490) Sitemap processing: Sitemap index files not working Wed, 03 Jan, 09:48
Lewis John McGibbney (JIRA)   [jira] [Updated] (NUTCH-2490) Sitemap processing: Sitemap index files not working Wed, 03 Jan, 17:42
Lewis John McGibbney (JIRA)   [jira] [Resolved] (NUTCH-2490) Sitemap processing: Sitemap index files not working Wed, 03 Jan, 17:42
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2490) Sitemap processing: Sitemap index files not working Wed, 03 Jan, 17:42
Hudson (JIRA)   [jira] [Commented] (NUTCH-2490) Sitemap processing: Sitemap index files not working Wed, 03 Jan, 17:52
[jira] [Created] (NUTCH-2491) Integrate sitemap processing and HostDB into crawl script
Moreno Feltscher (JIRA)   [jira] [Created] (NUTCH-2491) Integrate sitemap processing and HostDB into crawl script Wed, 03 Jan, 14:16
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2491) Integrate sitemap processing and HostDB into crawl script Wed, 03 Jan, 14:18
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2491) Integrate sitemap processing and HostDB into crawl script Wed, 03 Jan, 17:35
Lewis John McGibbney (JIRA)   [jira] [Updated] (NUTCH-2491) Integrate sitemap processing and HostDB into crawl script Wed, 03 Jan, 17:35
Lewis John McGibbney (JIRA)   [jira] [Resolved] (NUTCH-2491) Integrate sitemap processing and HostDB into crawl script Wed, 03 Jan, 17:36
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2491) Integrate sitemap processing and HostDB into crawl script Wed, 03 Jan, 17:40
Hudson (JIRA)   [jira] [Commented] (NUTCH-2491) Integrate sitemap processing and HostDB into crawl script Wed, 03 Jan, 17:52
[jira] [Commented] (NUTCH-2460) use the headless option of firefox and chrome in protocol-selenium
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2460) use the headless option of firefox and chrome in protocol-selenium Wed, 03 Jan, 15:45
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2460) use the headless option of firefox and chrome in protocol-selenium Wed, 03 Jan, 15:46
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2460) use the headless option of firefox and chrome in protocol-selenium Wed, 03 Jan, 15:47
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2460) use the headless option of firefox and chrome in protocol-selenium Wed, 03 Jan, 17:26
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2460) use the headless option of firefox and chrome in protocol-selenium Thu, 18 Jan, 17:53
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2460) use the headless option of firefox and chrome in protocol-selenium Mon, 29 Jan, 16:05
[jira] [Commented] (NUTCH-2454) REST API fix for usage of hostdb in generator
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2454) REST API fix for usage of hostdb in generator Wed, 03 Jan, 17:31
Lewis John McGibbney (JIRA)   [jira] [Resolved] (NUTCH-2454) REST API fix for usage of hostdb in generator Wed, 03 Jan, 17:32
Hudson (JIRA)   [jira] [Commented] (NUTCH-2454) REST API fix for usage of hostdb in generator Wed, 03 Jan, 17:52
[jira] [Commented] (NUTCH-1480) SolrIndexer to write to multiple servers.
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1480) SolrIndexer to write to multiple servers. Wed, 03 Jan, 17:37
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1480) SolrIndexer to write to multiple servers. Wed, 17 Jan, 16:16
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Wed, 03 Jan, 20:10
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Wed, 03 Jan, 20:10
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Fri, 05 Jan, 08:33
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Fri, 05 Jan, 08:33
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Mon, 08 Jan, 12:37
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Mon, 08 Jan, 12:38
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Mon, 08 Jan, 13:05
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Mon, 08 Jan, 13:08
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Mon, 08 Jan, 15:47
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Mon, 08 Jan, 16:07
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Mon, 08 Jan, 16:20
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Mon, 08 Jan, 20:48
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Wed, 10 Jan, 10:50
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Wed, 10 Jan, 10:51
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Wed, 10 Jan, 16:37
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Wed, 10 Jan, 16:37
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Wed, 10 Jan, 16:37
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Wed, 10 Jan, 16:37
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Wed, 10 Jan, 16:37
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Wed, 10 Jan, 16:37
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Wed, 10 Jan, 16:37
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Wed, 10 Jan, 16:37
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Wed, 10 Jan, 16:37
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Wed, 10 Jan, 18:11
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Wed, 10 Jan, 18:13
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Thu, 11 Jan, 09:30
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Thu, 11 Jan, 09:31
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Thu, 11 Jan, 09:31
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Thu, 11 Jan, 21:21
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Thu, 11 Jan, 21:21
Lewis John McGibbney (JIRA)   [jira] [Resolved] (NUTCH-1129) Any23 Nutch plugin Thu, 11 Jan, 21:22
Lewis John McGibbney (JIRA)   [jira] [Updated] (NUTCH-1129) Any23 Nutch plugin Thu, 11 Jan, 21:22
Moreno Feltscher (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Thu, 11 Jan, 21:48
Hudson (JIRA)   [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin Thu, 11 Jan, 22:14
[jira] [Created] (NUTCH-2492) Add more configuration parameters to crawl script
Moreno Feltscher (JIRA)   [jira] [Created] (NUTCH-2492) Add more configuration parameters to crawl script Wed, 03 Jan, 23:31
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2492) Add more configuration parameters to crawl script Wed, 03 Jan, 23:34
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2492) Add more configuration parameters to crawl script Thu, 04 Jan, 00:15
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2492) Add more configuration parameters to crawl script Mon, 08 Jan, 08:35
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2492) Add more configuration parameters to crawl script Mon, 08 Jan, 10:05
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2492) Add more configuration parameters to crawl script Mon, 08 Jan, 11:50
Lewis John McGibbney (JIRA)   [jira] [Updated] (NUTCH-2492) Add more configuration parameters to crawl script Mon, 08 Jan, 11:51
Lewis John McGibbney (JIRA)   [jira] [Resolved] (NUTCH-2492) Add more configuration parameters to crawl script Mon, 08 Jan, 11:51
Hudson (JIRA)   [jira] [Commented] (NUTCH-2492) Add more configuration parameters to crawl script Mon, 08 Jan, 12:54
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2492) Add more configuration parameters to crawl script Mon, 08 Jan, 14:09
[jira] [Commented] (NUTCH-2375) Upgrade the code base from org.apache.hadoop.mapred to org.apache.hadoop.mapreduce
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2375) Upgrade the code base from org.apache.hadoop.mapred to org.apache.hadoop.mapreduce Fri, 05 Jan, 17:37
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2375) Upgrade the code base from org.apache.hadoop.mapred to org.apache.hadoop.mapreduce Wed, 10 Jan, 01:51
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2375) Upgrade the code base from org.apache.hadoop.mapred to org.apache.hadoop.mapreduce Wed, 10 Jan, 01:52
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2375) Upgrade the code base from org.apache.hadoop.mapred to org.apache.hadoop.mapreduce Fri, 26 Jan, 16:25
[jira] [Resolved] (NUTCH-2467) Sitemap type field can be null
Sebastian Nagel (JIRA)   [jira] [Resolved] (NUTCH-2467) Sitemap type field can be null Sat, 06 Jan, 08:45
[jira] [Updated] (NUTCH-1807) avoid methods relying on system-specific default locale / charset
Sebastian Nagel (JIRA)   [jira] [Updated] (NUTCH-1807) avoid methods relying on system-specific default locale / charset Sun, 07 Jan, 21:07
Sebastian Nagel (JIRA)   [jira] [Commented] (NUTCH-1807) avoid methods relying on system-specific default locale / charset Sun, 07 Jan, 21:08
[jira] [Resolved] (NUTCH-2488) Please use SSL (https) for KEYS, sigs, hashes
Sebastian Nagel (JIRA)   [jira] [Resolved] (NUTCH-2488) Please use SSL (https) for KEYS, sigs, hashes Mon, 08 Jan, 12:25
Sebastian Nagel (JIRA)   [jira] [Assigned] (NUTCH-2488) Please use SSL (https) for KEYS, sigs, hashes Mon, 08 Jan, 12:26
Sebb (JIRA)   [jira] [Closed] (NUTCH-2488) Please use SSL (https) for KEYS, sigs, hashes Mon, 08 Jan, 13:28
[jira] [Commented] (NUTCH-2321) Indexing filter checker leaks threads
Jurian Broertjes (JIRA)   [jira] [Commented] (NUTCH-2321) Indexing filter checker leaks threads Mon, 08 Jan, 17:49
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2321) Indexing filter checker leaks threads Mon, 15 Jan, 17:38
Lewis John McGibbney (JIRA)   [jira] [Resolved] (NUTCH-2321) Indexing filter checker leaks threads Mon, 15 Jan, 17:39
Lewis John McGibbney (JIRA)   [jira] [Commented] (NUTCH-2321) Indexing filter checker leaks threads Mon, 15 Jan, 17:39
Hudson (JIRA)   [jira] [Commented] (NUTCH-2321) Indexing filter checker leaks threads Mon, 15 Jan, 17:54
[jira] [Created] (NUTCH-2493) Add configuration parameter for sitemap processing to crawler script
Moreno Feltscher (JIRA)   [jira] [Created] (NUTCH-2493) Add configuration parameter for sitemap processing to crawler script Tue, 09 Jan, 00:24
Moreno Feltscher (JIRA)   [jira] [Updated] (NUTCH-2493) Add configuration parameter for sitemap processing to crawler script Tue, 09 Jan, 00:26
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2493) Add configuration parameter for sitemap processing to crawler script Tue, 09 Jan, 00:32
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2493) Add configuration parameter for sitemap processing to crawler script Tue, 09 Jan, 14:50
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2493) Add configuration parameter for sitemap processing to crawler script Wed, 10 Jan, 15:51
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2493) Add configuration parameter for sitemap processing to crawler script Wed, 10 Jan, 16:15
Lewis John McGibbney (JIRA)   [jira] [Resolved] (NUTCH-2493) Add configuration parameter for sitemap processing to crawler script Wed, 10 Jan, 16:16
Lewis John McGibbney (JIRA)   [jira] [Updated] (NUTCH-2493) Add configuration parameter for sitemap processing to crawler script Wed, 10 Jan, 16:16
Hudson (JIRA)   [jira] [Commented] (NUTCH-2493) Add configuration parameter for sitemap processing to crawler script Wed, 10 Jan, 16:54
[jira] [Commented] (NUTCH-2441) ARG_SEGMENT usage
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2441) ARG_SEGMENT usage Wed, 10 Jan, 01:44
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2441) ARG_SEGMENT usage Wed, 17 Jan, 15:17
Lewis John McGibbney (JIRA)   [jira] [Resolved] (NUTCH-2441) ARG_SEGMENT usage Thu, 18 Jan, 17:52
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2441) ARG_SEGMENT usage Thu, 18 Jan, 17:52
Hudson (JIRA)   [jira] [Commented] (NUTCH-2441) ARG_SEGMENT usage Thu, 18 Jan, 19:05
[jira] [Updated] (NUTCH-2324) Issue in setting default linkdb path
Lewis John McGibbney (JIRA)   [jira] [Updated] (NUTCH-2324) Issue in setting default linkdb path Wed, 10 Jan, 01:48
Lewis John McGibbney (JIRA)   [jira] [Resolved] (NUTCH-2324) Issue in setting default linkdb path Wed, 10 Jan, 01:51
[jira] [Created] (NUTCH-2494) Fetcher: java.lang.IllegalArgumentException: Wrong FS: s3
Ashraful Islam (JIRA)   [jira] [Created] (NUTCH-2494) Fetcher: java.lang.IllegalArgumentException: Wrong FS: s3 Thu, 11 Jan, 10:18
Ashraful Islam (JIRA)   [jira] [Updated] (NUTCH-2494) Fetcher: java.lang.IllegalArgumentException: Wrong FS: s3 Thu, 11 Jan, 10:20
Ashraful Islam (JIRA)   [jira] [Updated] (NUTCH-2494) Fetcher: java.lang.IllegalArgumentException: Wrong FS: s3 Thu, 11 Jan, 10:27
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2494) Fetcher: java.lang.IllegalArgumentException: Wrong FS: s3 Thu, 11 Jan, 11:26
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2494) Fetcher: java.lang.IllegalArgumentException: Wrong FS: s3 Wed, 17 Jan, 10:27
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2494) Fetcher: java.lang.IllegalArgumentException: Wrong FS: s3 Thu, 18 Jan, 06:53
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2494) Fetcher: java.lang.IllegalArgumentException: Wrong FS: s3 Mon, 29 Jan, 15:38
Sebastian Nagel (JIRA)   [jira] [Updated] (NUTCH-2494) Fetcher: java.lang.IllegalArgumentException: Wrong FS: s3 Mon, 29 Jan, 15:39
Sebastian Nagel (JIRA)   [jira] [Assigned] (NUTCH-2494) Fetcher: java.lang.IllegalArgumentException: Wrong FS: s3 Mon, 29 Jan, 15:40
Sebastian Nagel (JIRA)   [jira] [Resolved] (NUTCH-2494) Fetcher: java.lang.IllegalArgumentException: Wrong FS: s3 Mon, 29 Jan, 15:42
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2494) Fetcher: java.lang.IllegalArgumentException: Wrong FS: s3 Mon, 29 Jan, 15:42
Hudson (JIRA)   [jira] [Commented] (NUTCH-2494) Fetcher: java.lang.IllegalArgumentException: Wrong FS: s3 Mon, 29 Jan, 16:10
[jira] [Created] (NUTCH-2495) Use -deleteGone instead of clean job in crawler script while indexing
Moreno Feltscher (JIRA)   [jira] [Created] (NUTCH-2495) Use -deleteGone instead of clean job in crawler script while indexing Fri, 12 Jan, 23:23
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2495) Use -deleteGone instead of clean job in crawler script while indexing Fri, 12 Jan, 23:24
Moreno Feltscher (JIRA)   [jira] [Assigned] (NUTCH-2495) Use -deleteGone instead of clean job in crawler script while indexing Tue, 23 Jan, 17:56
[jira] [Created] (NUTCH-2496) Speed up link inversion step in crawling script
Moreno Feltscher (JIRA)   [jira] [Created] (NUTCH-2496) Speed up link inversion step in crawling script Fri, 12 Jan, 23:33
Moreno Feltscher (JIRA)   [jira] [Assigned] (NUTCH-2496) Speed up link inversion step in crawling script Fri, 12 Jan, 23:34
Moreno Feltscher (JIRA)   [jira] [Commented] (NUTCH-2496) Speed up link inversion step in crawling script Fri, 12 Jan, 23:35
Markus Jelsma (JIRA)   [jira] [Commented] (NUTCH-2496) Speed up link inversion step in crawling script Sat, 13 Jan, 09:05
Moreno Feltscher (JIRA)   [jira] [Commented] (NUTCH-2496) Speed up link inversion step in crawling script Mon, 15 Jan, 23:51
Markus Jelsma (JIRA)   [jira] [Comment Edited] (NUTCH-2496) Speed up link inversion step in crawling script Tue, 16 Jan, 10:53
Markus Jelsma (JIRA)   [jira] [Commented] (NUTCH-2496) Speed up link inversion step in crawling script Tue, 16 Jan, 10:53
Moreno Feltscher (JIRA)   [jira] [Commented] (NUTCH-2496) Speed up link inversion step in crawling script Thu, 18 Jan, 00:35
[jira] [Created] (NUTCH-2497) Elastic REST Indexer: Allow multiple hosts
Moreno Feltscher (JIRA)   [jira] [Created] (NUTCH-2497) Elastic REST Indexer: Allow multiple hosts Sat, 13 Jan, 01:11
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2497) Elastic REST Indexer: Allow multiple hosts Sat, 13 Jan, 01:12
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2497) Elastic REST Indexer: Allow multiple hosts Mon, 15 Jan, 17:37
Lewis John McGibbney (JIRA)   [jira] [Updated] (NUTCH-2497) Elastic REST Indexer: Allow multiple hosts Thu, 18 Jan, 17:48
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2497) Elastic REST Indexer: Allow multiple hosts Thu, 18 Jan, 17:48
Lewis John McGibbney (JIRA)   [jira] [Resolved] (NUTCH-2497) Elastic REST Indexer: Allow multiple hosts Thu, 18 Jan, 17:49
Hudson (JIRA)   [jira] [Commented] (NUTCH-2497) Elastic REST Indexer: Allow multiple hosts Thu, 18 Jan, 19:05
[jira] [Created] (NUTCH-2498) Docker fiels are outdated
dhirajforyou (JIRA)   [jira] [Created] (NUTCH-2498) Docker fiels are outdated Sat, 13 Jan, 08:58
dhirajforyou (JIRA)   [jira] [Updated] (NUTCH-2498) Docker files are outdated Sat, 13 Jan, 09:00
dhirajforyou (JIRA)   [jira] [Updated] (NUTCH-2498) Docker files are outdated Sat, 13 Jan, 09:04
[jira] [Commented] (NUTCH-2461) Generate passes the data to when maxCount == 0
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2461) Generate passes the data to when maxCount == 0 Mon, 15 Jan, 17:40
Lewis John McGibbney (JIRA)   [jira] [Resolved] (NUTCH-2461) Generate passes the data to when maxCount == 0 Mon, 15 Jan, 17:41
Hudson (JIRA)   [jira] [Commented] (NUTCH-2461) Generate passes the data to when maxCount == 0 Mon, 15 Jan, 17:54
[jira] [Updated] (NUTCH-2499) Elastic REST Indexer: Duplicate values
Moreno Feltscher (JIRA)   [jira] [Updated] (NUTCH-2499) Elastic REST Indexer: Duplicate values Tue, 16 Jan, 21:43
Moreno Feltscher (JIRA)   [jira] [Created] (NUTCH-2499) Elastic REST Indexer: Duplicate values Tue, 16 Jan, 21:43
Moreno Feltscher (JIRA)   [jira] [Updated] (NUTCH-2499) Elastic REST Indexer: Duplicate values Tue, 16 Jan, 21:43
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2499) Elastic REST Indexer: Duplicate values Tue, 16 Jan, 21:47
Moreno Feltscher (JIRA)   [jira] [Updated] (NUTCH-2499) Elastic REST Indexer: Duplicate values Tue, 16 Jan, 21:47
Moreno Feltscher (JIRA)   [jira] [Assigned] (NUTCH-2499) Elastic REST Indexer: Duplicate values Tue, 23 Jan, 16:18
Lewis John McGibbney (JIRA)   [jira] [Updated] (NUTCH-2499) Elastic REST Indexer: Duplicate values Tue, 23 Jan, 17:59
Lewis John McGibbney (JIRA)   [jira] [Resolved] (NUTCH-2499) Elastic REST Indexer: Duplicate values Tue, 23 Jan, 17:59
Hudson (JIRA)   [jira] [Commented] (NUTCH-2499) Elastic REST Indexer: Duplicate values Tue, 23 Jan, 18:58
[jira] [Commented] (NUTCH-2370) FileDumper: save JSON mapping file -> URL
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2370) FileDumper: save JSON mapping file -> URL Tue, 16 Jan, 22:00
[jira] [Created] (NUTCH-2500) Add pull-reqest template to github
Sebastian Nagel (JIRA)   [jira] [Created] (NUTCH-2500) Add pull-reqest template to github Wed, 17 Jan, 11:04
Sebastian Nagel (JIRA)   [jira] [Updated] (NUTCH-2500) Add pull-reqest template to github Wed, 17 Jan, 11:04
[jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects
Sebastian Nagel (JIRA)   [jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects Wed, 17 Jan, 13:26
Markus Jelsma (JIRA)   [jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects Wed, 17 Jan, 15:39
Markus Jelsma (JIRA)   [jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects Tue, 23 Jan, 15:50
Markus Jelsma (JIRA)   [jira] [Updated] (NUTCH-2466) Sitemap processor to follow redirects Tue, 23 Jan, 15:50
Markus Jelsma (JIRA)   [jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects Wed, 31 Jan, 12:26
Sebastian Nagel (JIRA)   [jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects Wed, 31 Jan, 12:35
Markus Jelsma (JIRA)   [jira] [Updated] (NUTCH-2466) Sitemap processor to follow redirects Wed, 31 Jan, 12:57
Markus Jelsma (JIRA)   [jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects Wed, 31 Jan, 12:57
Sebastian Nagel (JIRA)   [jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects Wed, 31 Jan, 13:29
Markus Jelsma (JIRA)   [jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects Wed, 31 Jan, 13:55
Markus Jelsma (JIRA)   [jira] [Resolved] (NUTCH-2466) Sitemap processor to follow redirects Wed, 31 Jan, 13:55
Hudson (JIRA)   [jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects Wed, 31 Jan, 14:55
Moreno Feltscher (JIRA)   [jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects Wed, 31 Jan, 22:46
Markus Jelsma (JIRA)   [jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects Wed, 31 Jan, 22:58
Moreno Feltscher (JIRA)   [jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects Wed, 31 Jan, 23:05
Markus Jelsma (JIRA)   [jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects Wed, 31 Jan, 23:08
Markus Jelsma (JIRA)   [jira] [Comment Edited] (NUTCH-2466) Sitemap processor to follow redirects Wed, 31 Jan, 23:15
Markus Jelsma (JIRA)   [jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects Wed, 31 Jan, 23:15
[jira] [Updated] (NUTCH-2481) HostDatum deltas(previous step statistics)
Semyon Semyonov (JIRA)   [jira] [Updated] (NUTCH-2481) HostDatum deltas(previous step statistics) Wed, 17 Jan, 13:32
Semyon Semyonov (JIRA)   [jira] [Updated] (NUTCH-2481) HostDatum deltas(previous step statistics) Wed, 17 Jan, 13:33
Semyon Semyonov (JIRA)   [jira] [Updated] (NUTCH-2481) HostDatum deltas(previous step statistics) and Metadata expressions Wed, 17 Jan, 13:36
Semyon Semyonov (JIRA)   [jira] [Updated] (NUTCH-2481) HostDatum deltas(previous step statistics) Wed, 17 Jan, 13:36
Semyon Semyonov (JIRA)   [jira] [Updated] (NUTCH-2481) HostDatum deltas(previous step statistics) and Metadata expressions Wed, 17 Jan, 13:44
Semyon Semyonov (JIRA)   [jira] [Updated] (NUTCH-2481) HostDatum deltas(previous step statistics) and Metadata expressions Wed, 17 Jan, 13:44
Semyon Semyonov (JIRA)   [jira] [Commented] (NUTCH-2481) HostDatum deltas(previous step statistics) and Metadata expressions Fri, 19 Jan, 11:37
Semyon Semyonov (JIRA)   [jira] [Comment Edited] (NUTCH-2481) HostDatum deltas(previous step statistics) and Metadata expressions Thu, 25 Jan, 16:20
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2481) HostDatum deltas(previous step statistics) and Metadata expressions Mon, 29 Jan, 15:28
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2481) HostDatum deltas(previous step statistics) and Metadata expressions Mon, 29 Jan, 15:43
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2481) HostDatum deltas(previous step statistics) and Metadata expressions Mon, 29 Jan, 15:50
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2481) HostDatum deltas(previous step statistics) and Metadata expressions Mon, 29 Jan, 15:51
[jira] [Commented] (NUTCH-2455) Speed up the merging of HostDb entries for variable fetch delay
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2455) Speed up the merging of HostDb entries for variable fetch delay Thu, 18 Jan, 13:00
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2455) Speed up the merging of HostDb entries for variable fetch delay Thu, 18 Jan, 17:51
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2455) Speed up the merging of HostDb entries for variable fetch delay Thu, 25 Jan, 14:50
[jira] [Created] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script
Moreno Feltscher (JIRA)   [jira] [Created] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script Mon, 22 Jan, 22:31
Moreno Feltscher (JIRA)   [jira] [Commented] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script Tue, 23 Jan, 16:17
Moreno Feltscher (JIRA)   [jira] [Assigned] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script Tue, 23 Jan, 17:55
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script Fri, 26 Jan, 08:56
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script Fri, 26 Jan, 08:56
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script Fri, 26 Jan, 08:56
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script Fri, 26 Jan, 08:56
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script Mon, 29 Jan, 11:36
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script Mon, 29 Jan, 12:33
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script Mon, 29 Jan, 15:33
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script Wed, 31 Jan, 22:53
[jira] [Created] (NUTCH-2502) Any23 Plugin: Add Content-Type filtering
Moreno Feltscher (JIRA)   [jira] [Created] (NUTCH-2502) Any23 Plugin: Add Content-Type filtering Tue, 23 Jan, 13:29
Moreno Feltscher (JIRA)   [jira] [Commented] (NUTCH-2502) Any23 Plugin: Add Content-Type filtering Tue, 23 Jan, 16:16
Moreno Feltscher (JIRA)   [jira] [Assigned] (NUTCH-2502) Any23 Plugin: Add Content-Type filtering Tue, 23 Jan, 17:55
Lewis John McGibbney (JIRA)   [jira] [Updated] (NUTCH-2502) Any23 Plugin: Add Content-Type filtering Tue, 23 Jan, 18:01
Lewis John McGibbney (JIRA)   [jira] [Resolved] (NUTCH-2502) Any23 Plugin: Add Content-Type filtering Tue, 23 Jan, 18:02
Hudson (JIRA)   [jira] [Commented] (NUTCH-2502) Any23 Plugin: Add Content-Type filtering Tue, 23 Jan, 18:58
[jira] [Created] (NUTCH-2503) Add option to run tests for a single plugin
Moreno Feltscher (JIRA)   [jira] [Created] (NUTCH-2503) Add option to run tests for a single plugin Tue, 23 Jan, 16:07
Markus Jelsma (JIRA)   [jira] [Commented] (NUTCH-2503) Add option to run tests for a single plugin Tue, 23 Jan, 16:13
Moreno Feltscher (JIRA)   [jira] [Commented] (NUTCH-2503) Add option to run tests for a single plugin Tue, 23 Jan, 16:16
Lewis John McGibbney (JIRA)   [jira] [Updated] (NUTCH-2503) Add option to run tests for a single plugin Tue, 23 Jan, 17:52
Lewis John McGibbney (JIRA)   [jira] [Resolved] (NUTCH-2503) Add option to run tests for a single plugin Tue, 23 Jan, 17:53
Hudson (JIRA)   [jira] [Commented] (NUTCH-2503) Add option to run tests for a single plugin Tue, 23 Jan, 18:58
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2503) Add option to run tests for a single plugin Tue, 23 Jan, 22:00
[jira] [Updated] (NUTCH-2369) Create a new GraphGenerator Tool for writing Nutch Records as a Full Web Graph
Lewis John McGibbney (JIRA)   [jira] [Updated] (NUTCH-2369) Create a new GraphGenerator Tool for writing Nutch Records as a Full Web Graph Wed, 24 Jan, 20:55
Markus Jelsma (JIRA)   [jira] [Commented] (NUTCH-2369) Create a new GraphGenerator Tool for writing Nutch Records as a Full Web Graph Wed, 24 Jan, 21:53
Lewis John McGibbney (JIRA)   [jira] [Commented] (NUTCH-2369) Create a new GraphGenerator Tool for writing Nutch Records as a Full Web Graph Fri, 26 Jan, 17:32
[jira] [Created] (NUTCH-2504) Results of maxCountExpr and fetchDelayExpr should be stored in memory in Generate
Semyon Semyonov (JIRA)   [jira] [Created] (NUTCH-2504) Results of maxCountExpr and fetchDelayExpr should be stored in memory in Generate Thu, 25 Jan, 14:55
Semyon Semyonov (JIRA)   [jira] [Updated] (NUTCH-2504) Results of maxCountExpr and fetchDelayExpr should be stored in memory in Generate Thu, 25 Jan, 15:07
Semyon Semyonov (JIRA)   [jira] [Updated] (NUTCH-2504) Results of maxCountExpr and fetchDelayExpr should be stored in memory in Generate Thu, 25 Jan, 15:07
Semyon Semyonov (JIRA)   [jira] [Updated] (NUTCH-2504) Results of maxCountExpr and fetchDelayExpr should be stored in memory in Generate Thu, 25 Jan, 15:07
[jira] [Commented] (NUTCH-2202) Integration of Anthelion (Focused Crawling Module) into Nutch
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2202) Integration of Anthelion (Focused Crawling Module) into Nutch Fri, 26 Jan, 04:21
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2202) Integration of Anthelion (Focused Crawling Module) into Nutch Fri, 26 Jan, 19:36
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2202) Integration of Anthelion (Focused Crawling Module) into Nutch Fri, 26 Jan, 20:03
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2202) Integration of Anthelion (Focused Crawling Module) into Nutch Fri, 26 Jan, 23:22
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2202) Integration of Anthelion (Focused Crawling Module) into Nutch Sat, 27 Jan, 00:31
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2202) Integration of Anthelion (Focused Crawling Module) into Nutch Sat, 27 Jan, 00:34
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2202) Integration of Anthelion (Focused Crawling Module) into Nutch Sun, 28 Jan, 22:04
Apache Wiki [Nutch Wiki] New attachment added to page Anthelion Fri, 26 Jan, 19:28
[jira] [Created] (NUTCH-2505) nutch does not delete the .locked file, when the generator partition got an exception
Ajoy Lian (JIRA)   [jira] [Created] (NUTCH-2505) nutch does not delete the .locked file, when the generator partition got an exception Sat, 27 Jan, 07:57
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2505) nutch does not delete the .locked file, when the generator partition got an exception Sat, 27 Jan, 08:15
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2505) nutch does not delete the .locked file, when the generator partition got an exception Mon, 29 Jan, 12:38
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2505) nutch does not delete the .locked file, when the generator partition got an exception Mon, 29 Jan, 15:58
[jira] [Created] (NUTCH-2506) host is not available for filtering on the JEXL indexing plugin
Jorge Luis Betancourt Gonzalez (JIRA)   [jira] [Created] (NUTCH-2506) host is not available for filtering on the JEXL indexing plugin Tue, 30 Jan, 12:56
Jorge Luis Betancourt Gonzalez (JIRA)   [jira] [Commented] (NUTCH-2506) host is not available for filtering on the JEXL indexing plugin Tue, 30 Jan, 14:47
[jira] [Created] (NUTCH-2507) NutchTutorial wiki pages as a lot of outdated command line calls when it starts with the solr interaction
artodeto (JIRA)   [jira] [Created] (NUTCH-2507) NutchTutorial wiki pages as a lot of outdated command line calls when it starts with the solr interaction Wed, 31 Jan, 11:15
[jira] [Created] (NUTCH-2508) Misleading documentation about http.proxy.exception.list
Moreno Feltscher (JIRA)   [jira] [Created] (NUTCH-2508) Misleading documentation about http.proxy.exception.list Wed, 31 Jan, 22:39
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2508) Misleading documentation about http.proxy.exception.list Wed, 31 Jan, 22:52
ASF GitHub Bot (JIRA)   [jira] [Commented] (NUTCH-2508) Misleading documentation about http.proxy.exception.list Wed, 31 Jan, 23:00
Lewis John McGibbney (JIRA)   [jira] [Updated] (NUTCH-2508) Misleading documentation about http.proxy.exception.list Wed, 31 Jan, 23:00
Lewis John McGibbney (JIRA)   [jira] [Resolved] (NUTCH-2508) Misleading documentation about http.proxy.exception.list Wed, 31 Jan, 23:00
Message listThread · Author · Date
Box list
Jul 201931
Jun 201910
May 201979
Apr 201977
Mar 201949
Feb 201971
Jan 2019156
Dec 201844
Nov 201891
Oct 2018245
Sep 201893
Aug 201881
Jul 2018166
Jun 2018360
May 2018136
Apr 2018232
Mar 2018272
Feb 201865
Jan 2018234
Dec 2017419
Nov 2017200
Oct 2017241
Sep 2017188
Aug 2017169
Jul 2017142
Jun 2017107
May 2017126
Apr 2017128
Mar 2017119
Feb 201773
Jan 2017162
Dec 201653
Nov 201636
Oct 201691
Sep 201658
Aug 2016296
Jul 2016152
Jun 2016200
May 2016224
Apr 2016153
Mar 2016218
Feb 2016461
Jan 2016240
Dec 2015171
Nov 2015204
Oct 2015412
Sep 2015458
Aug 2015259
Jul 2015304
Jun 2015446
May 2015319
Apr 2015463
Mar 2015384
Feb 2015530
Jan 2015258
Dec 2014162
Nov 2014165
Oct 2014249
Sep 2014376
Aug 2014136
Jul 2014219
Jun 2014355
May 2014378
Apr 2014332
Mar 2014248
Feb 2014168
Jan 2014471
Dec 2013186
Nov 2013177
Oct 2013182
Sep 2013158
Aug 2013182
Jul 2013240
Jun 2013321
May 2013288
Apr 2013437
Mar 2013521
Feb 2013201
Jan 2013560
Dec 2012176
Nov 2012251
Oct 2012200
Sep 2012219
Aug 2012230
Jul 2012301
Jun 2012391
May 2012317
Apr 2012352
Mar 2012297
Feb 2012395
Jan 2012298
Dec 2011318
Nov 2011524
Oct 2011483
Sep 2011605
Aug 2011528
Jul 2011635
Jun 2011418
May 2011176
Apr 2011453
Mar 2011139
Feb 201162
Jan 2011150
Dec 2010100
Nov 201096
Oct 2010177
Sep 2010143
Aug 2010289
Jul 2010364
Jun 2010246
May 201075
Apr 2010124
Mar 2010183
Feb 2010134
Jan 2010106
Dec 200998
Nov 2009154
Oct 200988
Sep 200932
Aug 200982
Jul 200977
Jun 200994
May 2009104
Apr 200985
Mar 2009255
Feb 2009250
Jan 2009197
Dec 2008158
Nov 2008117
Oct 200884
Sep 2008101
Aug 200858
Jul 200832
Jun 200893
May 200857
Apr 200878
Mar 2008152
Feb 2008190
Jan 2008155
Dec 200768
Nov 2007188
Oct 2007179
Sep 2007189
Aug 2007135
Jul 2007283
Jun 2007241
May 2007188
Apr 2007144
Mar 2007282
Feb 2007241
Jan 2007266
Dec 2006103
Nov 2006222
Oct 2006187
Sep 2006166
Aug 2006281
Jul 2006180
Jun 2006262
May 2006282
Apr 2006247
Mar 2006304
Feb 2006349
Jan 2006558
Dec 2005412
Nov 2005288
Oct 2005313
Sep 2005339
Aug 2005426
Jul 2005228
Jun 2005178
May 2005140
Apr 2005497
Mar 2005398
Feb 200510