nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Markus Jelsma (JIRA)" <>
Subject [jira] [Updated] (NUTCH-480) Searching multiple indexes with a single nutch instance
Date Fri, 01 Apr 2011 14:35:07 GMT


Markus Jelsma updated NUTCH-480:

Bulk close of legacy issues:

> Searching multiple indexes with a single nutch instance
> -------------------------------------------------------
>                 Key: NUTCH-480
>                 URL:
>             Project: Nutch
>          Issue Type: Improvement
>          Components: searcher, web gui
>    Affects Versions: 0.8
>         Environment: Linux and Windows
>            Reporter: Ravi Chintakunta
>         Attachments:
> Searching across multiple indexes with a single instance of Nutch is a cool feature improvement.
I had this requirement for my production site, where we wanted to list the available categories
(indexes) to search as check boxes and the user could select any combination of indexes to
search.  The results page also displays the number of hits in each index.
> To do this:
> - I modified web.xml to include the paths to various search indexes
> - Modified to read all the indexes and create IndexReaders
> - Modified to handle multiple IndexReaders
> In the attached file you will find the patch to the Nutch 0.8 code base and also the
newly added files:
> - SearchServlet - a servlet that is the web interface for search. This is simplified
version of jsp versions (without the i18n) and outputs the results in text, xml or json format.
> - SearchConstants - an interface for messages and constants
> Please note that the patch includes the functionality for spell check - aka "Did you

This message is automatically generated by JIRA.
For more information on JIRA, see:

View raw message