ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2777) Upgrade to Hadoop 3.1 |
Mon, 06 Apr, 16:03 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2775) Fetcher to guarantee minimum delay even if robots.txt defines shorter Crawl-delay |
Fri, 10 Apr, 11:42 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2775) Fetcher to guarantee minimum delay even if robots.txt defines shorter Crawl-delay |
Fri, 10 Apr, 11:45 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2777) Upgrade to Hadoop 3.1 |
Fri, 10 Apr, 11:45 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2777) Upgrade to Hadoop 3.1 |
Fri, 10 Apr, 11:46 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-2775) Fetcher to guarantee minimum delay even if robots.txt defines shorter Crawl-delay |
Fri, 10 Apr, 12:00 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-2777) Upgrade to Hadoop 3.1 |
Fri, 10 Apr, 12:56 |
Shashanka Balakuntala Srinivasa (Jira) |
[jira] [Assigned] (NUTCH-2765) Unify and cleanup X509TrustManager |
Fri, 10 Apr, 16:50 |
Rodrigo Pereira dos Santos (Jira) |
[jira] [Updated] (NUTCH-2651) Upgrade to Tika 1.19.1 (from 1.18) |
Sat, 11 Apr, 05:31 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2757) indexer-elastic: add authentication options |
Sat, 11 Apr, 13:52 |
Shashanka Balakuntala Srinivasa (Jira) |
[jira] [Assigned] (NUTCH-2755) Remove obsolete plugin indexer-elastic-rest |
Sun, 12 Apr, 07:01 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2757) indexer-elastic: add authentication options |
Mon, 13 Apr, 19:41 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2757) indexer-elastic: add authentication options |
Tue, 14 Apr, 06:25 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2757) indexer-elastic: add authentication options |
Tue, 14 Apr, 10:07 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2757) indexer-elastic: add authentication options |
Tue, 14 Apr, 10:07 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2757) indexer-elastic: add authentication options |
Tue, 14 Apr, 10:07 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2757) indexer-elastic: add authentication options |
Tue, 14 Apr, 10:22 |
Sebastian Nagel (Jira) |
[jira] [Created] (NUTCH-2778) indexer-elastic to properly log errors |
Wed, 15 Apr, 20:17 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2778) indexer-elastic to properly log errors |
Wed, 15 Apr, 20:27 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2757) indexer-elastic: add authentication options |
Wed, 15 Apr, 22:14 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2757) indexer-elastic: add authentication options |
Wed, 15 Apr, 22:16 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2757) indexer-elastic: add authentication options |
Wed, 15 Apr, 22:16 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2757) indexer-elastic: add authentication options |
Thu, 16 Apr, 11:13 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2757) indexer-elastic: add authentication options |
Thu, 16 Apr, 11:48 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2757) indexer-elastic: add authentication options |
Sun, 19 Apr, 09:33 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2757) indexer-elastic: add authentication options |
Sun, 19 Apr, 09:37 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-2757) indexer-elastic: add authentication options |
Sun, 19 Apr, 10:01 |
GitBox |
[GitHub] [nutch] balashashanka opened a new pull request #510: NUTCH-2755: Remove obsolete plugin indexer-elastic-rest |
Mon, 20 Apr, 14:11 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2755) Remove obsolete plugin indexer-elastic-rest |
Mon, 20 Apr, 14:12 |
Shashanka Balakuntala Srinivasa (Jira) |
[jira] [Commented] (NUTCH-1103) Port protocol-sftp to 1.4 |
Mon, 20 Apr, 16:51 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2755) Remove obsolete plugin indexer-elastic-rest |
Tue, 21 Apr, 09:31 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2677) Update Jest client in indexer-elastic-rest plugin |
Tue, 21 Apr, 09:32 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2304) Fix Elasticsearch Rest Indexing Plugin's Dependencies |
Tue, 21 Apr, 09:33 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2304) Fix Elasticsearch Rest Indexing Plugin's Dependencies |
Tue, 21 Apr, 09:35 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-1337) WebGraph to follow redirects |
Tue, 21 Apr, 09:37 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2277) Adding goldstandard.txt default file in conf |
Tue, 21 Apr, 09:38 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2207) Remove class duplication and smarten-up scoring-similarity plugin |
Tue, 21 Apr, 09:38 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-1086) Rewrite protocol-httpclient |
Tue, 21 Apr, 09:38 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2278) Handle alpha-2 language codes consistently |
Tue, 21 Apr, 09:39 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2334) Extension point for schedulers |
Tue, 21 Apr, 09:40 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2697) Upgrade Ivy to fix the issue of an unset packaging.type property. |
Tue, 21 Apr, 09:40 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2292) Mavenize the build for nutch-core and nutch-plugins |
Tue, 21 Apr, 09:40 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2417) Support for variable fetch delay via FreeGenerator |
Tue, 21 Apr, 09:42 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-2755) Remove obsolete plugin indexer-elastic-rest |
Tue, 21 Apr, 10:00 |
Sebastian Nagel (Jira) |
[jira] [Created] (NUTCH-2779) Upgrade to Tika 1.24.1 |
Tue, 21 Apr, 12:05 |
GitBox |
[GitHub] [nutch] sebastian-nagel opened a new pull request #511: NUTCH-2779 Upgrade to Tika 1.24.1 |
Tue, 21 Apr, 12:09 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2779) Upgrade to Tika 1.24.1 |
Tue, 21 Apr, 12:10 |
Sebastian Nagel (Jira) |
[jira] [Created] (NUTCH-2780) Upgrade index-solr to use Solr 8.5.1 |
Tue, 21 Apr, 12:13 |
Sebastian Nagel |
[DISCUSS] Release 1.17 ? |
Thu, 23 Apr, 06:27 |
Sebastian Nagel (Jira) |
[jira] [Commented] (NUTCH-1103) Port protocol-sftp to 1.4 |
Thu, 23 Apr, 06:50 |
Sebastian Nagel (Jira) |
[jira] [Comment Edited] (NUTCH-1103) Port protocol-sftp to 1.4 |
Thu, 23 Apr, 06:51 |
Sebastian Nagel (Jira) |
[jira] [Created] (NUTCH-2781) Increase default Java heap size |
Thu, 23 Apr, 07:19 |
Sebastian Nagel (Jira) |
[jira] [Commented] (NUTCH-2779) Upgrade to Tika 1.24.1 |
Thu, 23 Apr, 07:32 |
GitBox |
[GitHub] [nutch] sebastian-nagel opened a new pull request #512: NUTCH-2781 Increase default Java heap size |
Thu, 23 Apr, 08:22 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2781) Increase default Java heap size |
Thu, 23 Apr, 08:23 |
GitBox |
[GitHub] [nutch] sebastian-nagel opened a new pull request #513: NUTCH-2501 allow to set Java heap size when using crawl script in distributed mode |
Thu, 23 Apr, 10:23 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script |
Thu, 23 Apr, 10:24 |
GitBox |
[GitHub] [nutch] sebastian-nagel commented on a change in pull request #279: NUTCH-2501: Take NUTCH_HEAPSIZE into account when crawling using crawl script |
Thu, 23 Apr, 10:27 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script |
Thu, 23 Apr, 10:28 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2501) allow to set Java heap size when using crawl script in distributed mode |
Thu, 23 Apr, 10:34 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2342) Inlinks are not being indexed as part of index-links plugin |
Thu, 23 Apr, 10:40 |
Sebastian Nagel (Jira) |
[jira] [Commented] (NUTCH-2342) Inlinks are not being indexed as part of index-links plugin |
Thu, 23 Apr, 10:40 |
Sebastian Nagel (Jira) |
[jira] [Commented] (NUTCH-2379) crawl script dedup's crawldb update is slow |
Thu, 23 Apr, 10:45 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2780) Upgrade index-solr to use Solr 8.5.1 |
Thu, 23 Apr, 10:57 |
Sebastian Nagel (Jira) |
[jira] [Comment Edited] (NUTCH-2681) ClassCastException - Apache Nutch 1.x, Selenium v2.48.2, firefox 31.4.0 |
Thu, 23 Apr, 11:02 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2681) ClassCastException - Apache Nutch 1.x, Selenium v2.48.2, firefox 31.4.0 |
Thu, 23 Apr, 11:02 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2681) ClassCastException - Apache Nutch 1.x, Selenium v2.48.2, firefox 31.4.0 |
Thu, 23 Apr, 11:02 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2385) 1.x Elasticsearch Indexer - path.home is not configured |
Thu, 23 Apr, 11:07 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2274) InteractiveSelenium Plugin's DefaultHandler Returns Null |
Thu, 23 Apr, 11:11 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-1194) Generator: CrawlDB lock should be released earlier |
Thu, 23 Apr, 13:56 |
GitBox |
[GitHub] [nutch] sebastian-nagel opened a new pull request #514: NUTCH-1194 Generator: CrawlDB lock should be released earlier |
Thu, 23 Apr, 14:03 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-1194) Generator: CrawlDB lock should be released earlier |
Thu, 23 Apr, 14:04 |
Shashanka Balakuntala Srinivasa (Jira) |
[jira] [Assigned] (NUTCH-1103) Port protocol-sftp to 1.4 |
Thu, 23 Apr, 15:27 |
Sebastian Nagel (Jira) |
[jira] [Created] (NUTCH-2782) protocol-http / lib-http: support TLSv1.3 |
Fri, 24 Apr, 05:49 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2779) Upgrade to Tika 1.24.1 |
Fri, 24 Apr, 07:09 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-2779) Upgrade to Tika 1.24.1 |
Fri, 24 Apr, 07:56 |
GitBox |
[GitHub] [nutch] balashashanka opened a new pull request #515: NUTCH-2780 : Upgrade index-solr to use Solr 8.5.1 |
Fri, 24 Apr, 11:49 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2780) Upgrade index-solr to use Solr 8.5.1 |
Fri, 24 Apr, 11:50 |
Sebastian Nagel (Jira) |
[jira] [Created] (NUTCH-2783) Use (more) parametrized logging |
Fri, 24 Apr, 13:50 |
GitBox |
[GitHub] [nutch] sebastian-nagel opened a new pull request #516: NUTCH-2783 Use (more) parametrized logging |
Fri, 24 Apr, 13:58 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2783) Use (more) parametrized logging |
Fri, 24 Apr, 13:59 |
GitBox |
[GitHub] [nutch] lewismc commented on pull request #516: NUTCH-2783 Use (more) parametrized logging |
Fri, 24 Apr, 15:44 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2783) Use (more) parametrized logging |
Fri, 24 Apr, 15:45 |
GitBox |
[GitHub] [nutch] sebastian-nagel commented on a change in pull request #515: NUTCH-2780 : Upgrade index-solr to use Solr 8.5.1 |
Mon, 27 Apr, 07:50 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2780) Upgrade index-solr to use Solr 8.5.1 |
Mon, 27 Apr, 07:51 |
GitBox |
[GitHub] [nutch] sebastian-nagel opened a new pull request #517: NUTCH-2495: Use -deleteGone instead of clean job in crawl script while indexing |
Mon, 27 Apr, 08:29 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2495) Use -deleteGone instead of clean job in crawler script while indexing |
Mon, 27 Apr, 08:30 |
GitBox |
[GitHub] [nutch] sebastian-nagel commented on pull request #275: NUTCH-2495: Use -deleteGone instead of clean job in crawler script while indexing |
Mon, 27 Apr, 08:31 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2495) Use -deleteGone instead of clean job in crawler script while indexing |
Mon, 27 Apr, 08:32 |
Sebastian Nagel (Jira) |
[jira] [Created] (NUTCH-2784) Add tool to list Nutch and Hadoop properties |
Mon, 27 Apr, 08:48 |
GitBox |
[GitHub] [nutch] sebastian-nagel opened a new pull request #518: NUTCH-2784 Add tool to list Nutch and Hadoop properties |
Mon, 27 Apr, 08:52 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2784) Add tool to list Nutch and Hadoop properties |
Mon, 27 Apr, 08:53 |
GitBox |
[GitHub] [nutch] balashashanka commented on a change in pull request #515: NUTCH-2780 : Upgrade index-solr to use Solr 8.5.1 |
Mon, 27 Apr, 12:30 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2780) Upgrade index-solr to use Solr 8.5.1 |
Mon, 27 Apr, 12:31 |
GitBox |
[GitHub] [nutch] sebastian-nagel commented on pull request #515: NUTCH-2780 : Upgrade index-solr to use Solr 8.5.1 |
Mon, 27 Apr, 13:10 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2780) Upgrade index-solr to use Solr 8.5.1 |
Mon, 27 Apr, 13:11 |
GitBox |
[GitHub] [nutch] sebastian-nagel edited a comment on pull request #515: NUTCH-2780 : Upgrade index-solr to use Solr 8.5.1 |
Mon, 27 Apr, 13:11 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2780) Upgrade index-solr to use Solr 8.5.1 |
Mon, 27 Apr, 13:12 |
GitBox |
[GitHub] [nutch] balashashanka commented on pull request #515: NUTCH-2780 : Upgrade index-solr to use Solr 8.5.1 |
Mon, 27 Apr, 13:26 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2780) Upgrade index-solr to use Solr 8.5.1 |
Mon, 27 Apr, 13:27 |