nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sami Siren <ssi...@gmail.com>
Subject Re: [jira] Resolved: (NUTCH-711) Indexer failing after upgrade to Hadoop 0.19.1
Date Wed, 04 Mar 2009 16:02:11 GMT

Alternatively you could create another issue to track the proper fix and 
let this close during the release process.

--
 Sami Siren

Andrzej Bialecki (JIRA) wrote:
>      [ https://issues.apache.org/jira/browse/NUTCH-711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
>
> Andrzej Bialecki  resolved NUTCH-711.
> -------------------------------------
>
>     Resolution: Fixed
>
> Applied the patch in rev. 750037. I'm not closing this issue, because this needs to be
solved in a better way after 1.0.
>
>   
>> Indexer failing after upgrade to Hadoop 0.19.1
>> ----------------------------------------------
>>
>>                 Key: NUTCH-711
>>                 URL: https://issues.apache.org/jira/browse/NUTCH-711
>>             Project: Nutch
>>          Issue Type: Bug
>>    Affects Versions: 1.0.0
>>            Reporter: Andrzej Bialecki 
>>            Assignee: Andrzej Bialecki 
>>            Priority: Blocker
>>             Fix For: 1.0.0
>>
>>         Attachments: patch.txt
>>
>>
>> After upgrade to Hadoop 0.19.1 Reducer is initialized in a different order than before
(see http://svn.apache.org/viewvc?view=rev&revision=736239). IndexingFilters populate
current JobConf with field options that are required for IndexerOutputFormat to function properly.
However, the filters are instantiated in Reducer.configure(), which is now called after the
OutputFormat is initialized, and not before as previously.
>> The workaround for now is to instantiate IndexinigFilters once again inside IndexerOutputFormat.
 This issue should be revisited before 1.1 in order to find a better solution.
>> See this thread for more information: http://www.lucidimagination.com/search/document/7c62c625c7ea17fe/problem_with_crawling_using_the_latest_1_0_trunk
>>     
>
>   


Mime
View raw message