nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrzej Bialecki (JIRA)" <j...@apache.org>
Subject [jira] Commented: (NUTCH-760) Allow field mapping from nutch to solr index
Date Wed, 25 Nov 2009 21:00:39 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782617#action_12782617
] 

Andrzej Bialecki  commented on NUTCH-760:
-----------------------------------------

I reworked the patch to get rid of any left-overs of static Configuration, and changed the
concept of "schema" (which was misleading) to "mapping" throughout the patch and class names.

This is now committed in rev. 884269 - thanks!

> Allow field mapping from nutch to solr index
> --------------------------------------------
>
>                 Key: NUTCH-760
>                 URL: https://issues.apache.org/jira/browse/NUTCH-760
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>            Reporter: David Stuart
>            Assignee: Andrzej Bialecki 
>             Fix For: 1.1
>
>         Attachments: solrindex_schema.patch, solrindex_schema.patch, solrindex_schema.patch,
solrindex_schema.patch
>
>
> I am using nutch to crawl sites and have combined it
> with solr pushing the nutch index using the solrindex command. I have
> set it up as specified on the wiki using the copyField url to id in the
> schema. Whilst this works fine it is stuff's up my inputs from other
> sources in solr (e.g. using the solr data import handler) as they have
> both id's and url's. I have patch that implements a nutch xml schema
> defining what basic nutch fields map to in your solr push.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message