Markus Jelsma |
RE: [DISCUSS] Release 1.14? |
Fri, 08 Dec, 23:40 |
Markus Jelsma |
RE: [DISCUSS] Release 1.14? |
Tue, 12 Dec, 12:40 |
Markus Jelsma |
RE: [VOTE] Release Apache Nutch 1.14 RC#1 |
Tue, 19 Dec, 17:08 |
Markus Jelsma |
RE: [VOTE] Release Apache Nutch 1.14 RC#1 |
Tue, 19 Dec, 20:06 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2441) ARG_SEGMENT usage |
Sat, 02 Dec, 17:04 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2441) ARG_SEGMENT usage |
Sat, 02 Dec, 17:04 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Sat, 02 Dec, 17:08 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Sat, 02 Dec, 20:41 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2441) ARG_SEGMENT usage |
Mon, 04 Dec, 06:36 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2441) ARG_SEGMENT usage |
Mon, 04 Dec, 06:36 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2441) ARG_SEGMENT usage |
Mon, 04 Dec, 06:36 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2441) ARG_SEGMENT usage |
Mon, 04 Dec, 08:49 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2470) CrawlDbReader -stats to show quantiles of score |
Mon, 04 Dec, 20:28 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2456) Allow to index pages/URLs not contained in CrawlDb |
Tue, 05 Dec, 09:40 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2451) protocol-ftp to resolve relative URL when following redirects |
Tue, 05 Dec, 11:11 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2451) protocol-ftp to resolve relative URL when following redirects |
Tue, 05 Dec, 11:12 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2317) Plugin jars don't get added to classpath while running in local |
Tue, 05 Dec, 12:12 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2317) Plugin jars don't get added to classpath while running in local |
Tue, 05 Dec, 12:12 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2470) CrawlDbReader -stats to show quantiles of score |
Tue, 05 Dec, 12:24 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2399) indexer-elastic does not index multi-value fields (only the first value is indexed) |
Wed, 06 Dec, 11:49 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2473) Elasticsearch REST Indexer broken due to wrong depenency |
Thu, 07 Dec, 12:24 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2473) Elasticsearch REST Indexer broken due to wrong depenency |
Thu, 07 Dec, 15:13 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2455) Speed up the merging of HostDb entries for variable fetch delay |
Fri, 08 Dec, 16:06 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2474) CrawlDbReader -stats fails with ClassCastException |
Fri, 08 Dec, 21:47 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2473) Elasticsearch REST Indexer broken due to wrong depenency |
Tue, 12 Dec, 09:38 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2455) Speed up the merging of HostDb entries for variable fetch delay |
Wed, 13 Dec, 10:07 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2441) ARG_SEGMENT usage |
Wed, 13 Dec, 13:35 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2473) Elasticsearch REST Indexer broken due to wrong depenency |
Wed, 13 Dec, 20:19 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2460) use the headless option of firefox and chrome in protocol-selenium |
Wed, 13 Dec, 20:23 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2450) Remove FixMe in ParseOutputFormat |
Wed, 13 Dec, 20:24 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2438) Upgrade Nutch 2.X to Gora 0.8 |
Wed, 13 Dec, 20:47 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2438) Upgrade Nutch 2.X to Gora 0.8 |
Wed, 13 Dec, 20:48 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2438) Upgrade Nutch 2.X to Gora 0.8 |
Wed, 13 Dec, 20:48 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2438) Upgrade Nutch 2.X to Gora 0.8 |
Wed, 13 Dec, 20:48 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2414) Allow LanguageIndexingFilter to actually filter documents by language. |
Wed, 13 Dec, 20:54 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2414) Allow LanguageIndexingFilter to actually filter documents by language. |
Wed, 13 Dec, 20:55 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2414) Allow LanguageIndexingFilter to actually filter documents by language. |
Wed, 13 Dec, 20:55 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2415) Create a JEXL based IndexingFilter |
Wed, 13 Dec, 20:57 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2473) Elasticsearch REST Indexer broken due to wrong depenency |
Wed, 13 Dec, 21:02 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2473) Elasticsearch REST Indexer broken due to wrong depenency |
Wed, 13 Dec, 21:31 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2473) Elasticsearch REST Indexer broken due to wrong depenency |
Wed, 13 Dec, 21:32 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2473) Elasticsearch REST Indexer broken due to wrong depenency |
Wed, 13 Dec, 21:36 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2415) Create a JEXL based IndexingFilter |
Thu, 14 Dec, 12:04 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2474) CrawlDbReader -stats fails with ClassCastException |
Thu, 14 Dec, 15:13 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2415) Create a JEXL based IndexingFilter |
Thu, 14 Dec, 22:36 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2415) Create a JEXL based IndexingFilter |
Thu, 14 Dec, 22:36 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2415) Create a JEXL based IndexingFilter |
Thu, 14 Dec, 22:36 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2415) Create a JEXL based IndexingFilter |
Thu, 14 Dec, 22:36 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2415) Create a JEXL based IndexingFilter |
Fri, 15 Dec, 09:25 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2439) Upgrade to Apache Tika 1.17 |
Fri, 15 Dec, 12:54 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2439) Upgrade to Apache Tika 1.17 |
Fri, 15 Dec, 13:14 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2480) Upgrade crawler-commons dependency to 0.9 |
Fri, 15 Dec, 13:59 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2354) Upgrade Hadoop dependencies to 2.7.3 |
Fri, 15 Dec, 14:45 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2362) Upgrade MaxMind GeoIP version in index-geoip |
Fri, 15 Dec, 15:34 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2439) Upgrade to Apache Tika 1.17 |
Fri, 15 Dec, 17:18 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2480) Upgrade crawler-commons dependency to 0.9 |
Fri, 15 Dec, 19:37 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2354) Upgrade Hadoop dependencies to 2.7.4 |
Fri, 15 Dec, 19:38 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2362) Upgrade MaxMind GeoIP version in index-geoip |
Fri, 15 Dec, 20:49 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2450) Remove FixMe in ParseOutputFormat |
Sun, 17 Dec, 07:55 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2415) Create a JEXL based IndexingFilter |
Sun, 17 Dec, 10:45 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2365) HTTP Redirects to SubDomains don't get crawled |
Sun, 17 Dec, 10:51 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2415) Create a JEXL based IndexingFilter |
Sun, 17 Dec, 11:13 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2478) // is not a valid base URL |
Sun, 17 Dec, 11:34 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2477) Refactor *Checker classes to use base class for common code |
Sun, 17 Dec, 12:02 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2477) Refactor *Checker classes to use base class for common code |
Sun, 17 Dec, 13:16 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2415) Create a JEXL based IndexingFilter |
Sun, 17 Dec, 13:28 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2370) FileDumper: save JSON mapping file -> URL |
Sun, 17 Dec, 14:47 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2370) FileDumper: save JSON mapping file -> URL |
Sun, 17 Dec, 14:47 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2450) Remove FixMe in ParseOutputFormat |
Sun, 17 Dec, 15:40 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2483) Remove/replace indirect dependencies to org.json |
Sun, 17 Dec, 21:24 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2415) Create a JEXL based IndexingFilter |
Sun, 17 Dec, 21:32 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2359) Parsefilter-regex raises IndexOutOfBoundsException when rules are ill-formed |
Mon, 18 Dec, 14:23 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2359) Parsefilter-regex raises IndexOutOfBoundsException when rules are ill-formed |
Mon, 18 Dec, 14:23 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2295) Nutch master docker container broken |
Mon, 18 Dec, 15:35 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2415) Create a JEXL based IndexingFilter |
Mon, 18 Dec, 15:50 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2365) HTTP Redirects to SubDomains don't get crawled if db.ignore.external.links.mode == byDomain |
Mon, 18 Dec, 16:30 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2295) Nutch master docker container broken |
Mon, 18 Dec, 16:34 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2483) Remove/replace indirect dependencies to org.json |
Mon, 18 Dec, 17:13 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2450) Remove FixMe in ParseOutputFormat |
Tue, 19 Dec, 04:09 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2450) Remove FixMe in ParseOutputFormat |
Tue, 19 Dec, 04:09 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2486) Compiler Warning: Unchecked / unsafe operations in MimeTypeIndexingFilter |
Tue, 19 Dec, 14:54 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-2486) Compiler Warning: Unchecked / unsafe operations in MimeTypeIndexingFilter |
Tue, 19 Dec, 14:57 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 08:56 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 08:56 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 08:56 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 08:56 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 08:56 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 08:56 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 08:56 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 08:56 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 08:56 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 08:56 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 18:16 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 18:17 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 18:17 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 18:17 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 18:17 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 18:18 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 18:22 |
ASF GitHub Bot (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Wed, 20 Dec, 19:45 |