nutch-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From mar...@apache.org
Subject [nutch] 02/03: NUTCH-2692 Subcollection to support case-insensitive white and black lists
Date Fri, 22 Feb 2019 15:49:07 GMT
This is an automated email from the ASF dual-hosted git repository.

markus pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git

commit 3fa2f4a7efac598258eb01a4387b5fde43c1a813
Author: Markus Jelsma <markus@apache.org>
AuthorDate: Fri Feb 22 16:46:42 2019 +0100

    NUTCH-2692 Subcollection to support case-insensitive white and black lists
---
 conf/host-protocol-mapping.txt | 11 +++++++++++
 1 file changed, 11 insertions(+)

diff --git a/conf/host-protocol-mapping.txt b/conf/host-protocol-mapping.txt
new file mode 100644
index 0000000..d0a1b70
--- /dev/null
+++ b/conf/host-protocol-mapping.txt
@@ -0,0 +1,11 @@
+# This file defines a hostname to protocol plugin mapping. Each line takes a
+# host name followed by a tab, followed by the ID of the protocol plugin. You
+# can find the ID in the protocol plugin's plugin.xml file.
+# 
+# <hostname>\t<plugin_id>\n
+# nutch.apache.org	org.apache.nutch.protocol.httpclient.Http
+# tika.apache.org	org.apache.nutch.protocol.http.Http
+#
+nutch.apache.org	org.apache.nutch.protocol.httpclient.Http
+tika.apache.org	org.apache.nutch.protocol.http.Http
+


Mime
View raw message