nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dan Rosher (Updated) (JIRA)" <>
Subject [jira] [Updated] (NUTCH-1294) IndexClean job with solr implementation.
Date Thu, 01 Mar 2012 15:51:57 GMT


Dan Rosher updated NUTCH-1294:

    Attachment:     (was: NUTCH-1294.patch)
> IndexClean job with solr implementation.
> ----------------------------------------
>                 Key: NUTCH-1294
>                 URL:
>             Project: Nutch
>          Issue Type: Improvement
>    Affects Versions: nutchgora
>            Reporter: Dan Rosher
>            Priority: Minor
>             Fix For: nutchgora
>         Attachments: NUTCH-1294.patch
> I started by copying/altering the trunk version of SolrClean, though is was inadequate
for our needs. We needed to mark particular pages as gone even though they still might be
visible on the web, this implementation abstracts the index cleaning process, has a Solr implementation,
and adds a clean index plugin extension that allows others to tailor how pages might be removed
from their store.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


View raw message