nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dan Rosher (Created) (JIRA)" <j...@apache.org>
Subject [jira] [Created] (NUTCH-1294) IndexClean job with solr implementation.
Date Thu, 01 Mar 2012 15:47:57 GMT
IndexClean job with solr implementation.
----------------------------------------

                 Key: NUTCH-1294
                 URL: https://issues.apache.org/jira/browse/NUTCH-1294
             Project: Nutch
          Issue Type: Improvement
    Affects Versions: nutchgora
            Reporter: Dan Rosher
            Priority: Minor
             Fix For: nutchgora
         Attachments: NUTCH-1294.patch

I started by copying/altering the trunk version of SolrClean, though is was inadequate for
our needs. We needed to mark particular pages as gone even though they still might be visible
on the web, this implementation abstracts the index cleaning process, has a Solr implementation,
and adds a clean index plugin extension that allows others to tailor how pages might be removed
from their store.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message