nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Markus Jelsma (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (NUTCH-638) Launching Distributed Searchers with URI indicating filesystem to use rather than relying on hadoop config files.
Date Fri, 01 Apr 2011 14:35:09 GMT

     [ https://issues.apache.org/jira/browse/NUTCH-638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Markus Jelsma updated NUTCH-638:
--------------------------------


Bulk close of legacy issues:
http://www.lucidimagination.com/search/document/2738eeb014805854/clean_up_open_legacy_issues_in_jira

> Launching Distributed Searchers with URI indicating filesystem to use rather than relying
on hadoop config files.
> -----------------------------------------------------------------------------------------------------------------
>
>                 Key: NUTCH-638
>                 URL: https://issues.apache.org/jira/browse/NUTCH-638
>             Project: Nutch
>          Issue Type: Improvement
>          Components: searcher
>    Affects Versions: 1.0.0
>            Reporter: Aaron Nall
>            Priority: Minor
>         Attachments: distributed-search-uri.patch
>
>   Original Estimate: 0.25h
>  Remaining Estimate: 0.25h
>
> I wanted to conduct all index creation operations in hdfs but search from the local file
system using the same cluster of machines.  I believe that this is a common use case.  
> This required either a parallel nutch install or edits (scripted or manual) to hadoop-site.xml
to change the file system from hdfs to local when starting a distributed searcher service.
 This minor patch makes IndexSearcher and NutchBean honor URIs as supported by hadoop 0.17
without altering existing functionality if simple paths are entered.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message