nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lewis John McGibbney (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (NUTCH-1294) IndexClean job with solr implementation.
Date Tue, 18 Sep 2012 20:25:09 GMT

     [ https://issues.apache.org/jira/browse/NUTCH-1294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Lewis John McGibbney updated NUTCH-1294:
----------------------------------------

    Fix Version/s:     (was: 2.1)
                   2.2
    
> IndexClean job with solr implementation.
> ----------------------------------------
>
>                 Key: NUTCH-1294
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1294
>             Project: Nutch
>          Issue Type: Improvement
>    Affects Versions: nutchgora
>            Reporter: Dan Rosher
>            Priority: Minor
>             Fix For: 2.2
>
>         Attachments: NUTCH-1294.patch, NUTCH-1294-v2.patch
>
>
> I started by copying/altering the trunk version of SolrClean, though is was inadequate
for our needs. We needed to mark particular pages as gone even though they still might be
visible on the web, this implementation abstracts the index cleaning process, has a Solr implementation,
and adds a clean index plugin extension that allows others to tailor how pages might be removed
from their store.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message