nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <>
Subject [Nutch Wiki] Update of "PluginCentral" by JorgeLuis
Date Wed, 19 Nov 2014 21:39:38 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "PluginCentral" page has been changed by JorgeLuis:

Modifying a plugin description for clarity

   * [[|index-extra]] - Adds user-configurable
fields to the index.
   * [[|protocol-smb]] - Allows Nutch to crawl
MS Windows Shares folder.
   * [[IndexMetatags|Index HTML Metatags]]: allows to parse HTML metatags and store them in
separate index fields
-  * [[|mimetype-filter]] - Allows Nutch to filter
crawled documents before indexing.
+  * [[|mimetype-filter]] - Allows Nutch to filter
crawled documents before indexing by the extracted MIME type.
   * [[|links-extractor]] - Allows Nutch to index
the inlinks and outlinks of any Web page.

View raw message