From dev-return-40908-apmail-nutch-dev-archive=nutch.apache.org@nutch.apache.org Tue Dec 8 10:59:04 2020 Return-Path: X-Original-To: apmail-nutch-dev-archive@www.apache.org Delivered-To: apmail-nutch-dev-archive@www.apache.org Received: from mxout1-he-de.apache.org (mxout1-he-de.apache.org [95.216.194.37]) by minotaur.apache.org (Postfix) with ESMTP id 13E8E19288 for ; Tue, 8 Dec 2020 10:59:04 +0000 (UTC) Received: from mail.apache.org (mailroute1-lw-us.apache.org [207.244.88.153]) by mxout1-he-de.apache.org (ASF Mail Server at mxout1-he-de.apache.org) with SMTP id 1561966CD0 for ; Tue, 8 Dec 2020 10:59:02 +0000 (UTC) Received: (qmail 80561 invoked by uid 500); 8 Dec 2020 10:59:01 -0000 Delivered-To: apmail-nutch-dev-archive@nutch.apache.org Received: (qmail 80515 invoked by uid 500); 8 Dec 2020 10:59:01 -0000 Mailing-List: contact dev-help@nutch.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@nutch.apache.org Delivered-To: mailing list dev@nutch.apache.org Received: (qmail 80493 invoked by uid 99); 8 Dec 2020 10:59:00 -0000 Received: from ec2-52-204-25-47.compute-1.amazonaws.com (HELO mailrelay1-ec2-va.apache.org) (52.204.25.47) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Dec 2020 10:59:00 +0000 Received: from jira2-he-de.apache.org (jira2-he-de.apache.org [168.119.33.54]) by mailrelay1-ec2-va.apache.org (ASF Mail Server at mailrelay1-ec2-va.apache.org) with ESMTPS id B454C3E9D6 for ; Tue, 8 Dec 2020 10:59:00 +0000 (UTC) Received: from jira2-he-de.apache.org (localhost.localdomain [127.0.0.1]) by jira2-he-de.apache.org (ASF Mail Server at jira2-he-de.apache.org) with ESMTP id 0FCD1C804A8 for ; Tue, 8 Dec 2020 10:59:00 +0000 (UTC) Date: Tue, 8 Dec 2020 10:59:00 +0000 (UTC) From: "ASF GitHub Bot (Jira)" To: dev@nutch.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (NUTCH-2834) Deduplication mode via command line in crawl script MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/NUTCH-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17245815#comment-17245815 ] ASF GitHub Bot commented on NUTCH-2834: --------------------------------------- derhecht opened a new pull request #557: URL: https://github.com/apache/nutch/pull/557 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org > Deduplication mode via command line in crawl script > --------------------------------------------------- > > Key: NUTCH-2834 > URL: https://issues.apache.org/jira/browse/NUTCH-2834 > Project: Nutch > Issue Type: Wish > Affects Versions: 1.17 > Reporter: Jakob Berlin > Priority: Minor > > Add possibility to have a command line parameter in crawl script which controls deduplication mode. -- This message was sent by Atlassian Jira (v8.3.4#803005)