nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sean Dean (JIRA)" <>
Subject [jira] Commented: (NUTCH-417) After upgrade to hadoop-0.9.1, parsing and indexing doesn't work.
Date Sat, 16 Dec 2006 20:20:22 GMT
    [ ] 
Sean Dean commented on NUTCH-417:

Speculative execution is now off by default with Hadoop 0.9.2 as per issue HADOOP-827. Since
there was only two other fixes with that distribution, neither of which should effect Nutch
in a bad way can that be updated in trunk?

> After upgrade to hadoop-0.9.1, parsing and indexing doesn't work.
> -----------------------------------------------------------------
>                 Key: NUTCH-417
>                 URL:
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 0.9.0
>            Reporter: Dogacan G├╝ney
>         Attachments: index.patch
> If you parse while fetching then it is fine, but if you run parse as a different job,
it creates an essentially empty parse_data directory(which has index files, but doesn't have
data files). I am not sure why this is happening.
> Also, indexing fails at Indexer.OutputFormat.getRecordWriter. The parameter fs seems
to be an instance of PhasedFileSystem which throws exceptions on delete and {start,complete}LocalOutput.

This message is automatically generated by JIRA.
If you think it was sent incorrectly contact one of the administrators:
For more information on JIRA, see:


View raw message