nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dennis Kubes (JIRA)" <j...@apache.org>
Subject [jira] Commented: (NUTCH-565) Arc File to Nutch Segments Converter
Date Tue, 09 Oct 2007 20:45:50 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12533498
] 

Dennis Kubes commented on NUTCH-565:
------------------------------------

Both jars are LGPL.  The archive-commons is from archive.org and is currently used in NutchWax.
 The fastutil jar is a subset of fastutil classes used by archive.org.

> Arc File to Nutch Segments Converter
> ------------------------------------
>
>                 Key: NUTCH-565
>                 URL: https://issues.apache.org/jira/browse/NUTCH-565
>             Project: Nutch
>          Issue Type: Improvement
>         Environment: all
>            Reporter: Dennis Kubes
>            Assignee: Dennis Kubes
>             Fix For: 1.0.0
>
>         Attachments: archive-commons-1.11.0-200612262257.jar, fastutil-5.0.3-heritrix-subset-1.0.jar,
nutch-565-1-20071009.patch
>
>
> Functionality that allows arc files, such as those produced by the internet archive project
or by the Grub distributed crawler to be parsed into Nutch segments.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message