nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrzej Bialecki ...@getopt.org>
Subject Re: [jira] Closed: (NUTCH-562) Port mime type framework to use Tika mime detection framework
Date Tue, 09 Oct 2007 20:57:21 GMT
Chris A. Mattmann (JIRA) wrote:
>      [ https://issues.apache.org/jira/browse/NUTCH-562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
> 
> Chris A. Mattmann closed NUTCH-562.
> -----------------------------------
> 
> 
> - Patch applied to trunk in r583016

I think this issue didn't get enough attention before it was committed. 
I agree with the direction of this patch - functionality-wise the mime 
type detector in Tika is clearly superior to the one that we have now in 
Nutch - but I feel that the use of an external framework, which is not 
yet released, should be discussed first, and the proper working of the 
patch should be confirmed by other users. There was too little time to 
do this before the commit.

I vote for reverting this patch, unless there is an overall consensus 
among Nutch developers that it's ok to keep it as it is - on one hand 
considering the added functionality and simplification of Nutch code, 
and on the other hand considering the (lack of) maturity of Tika.

-- 
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


Mime
View raw message