nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris A. Mattmann (JIRA)" <j...@apache.org>
Subject [jira] Resolved: (NUTCH-577) Use explicit tika-config.xml file to enable mime magic detection to be turned on and off
Date Mon, 27 Sep 2010 02:42:34 GMT

     [ https://issues.apache.org/jira/browse/NUTCH-577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Chris A. Mattmann resolved NUTCH-577.
-------------------------------------

    Fix Version/s: 2.0
       Resolution: Fixed

- it's been 3 years since I've reported this issue and Tika's MIME system has evolved significantly
since then (as has Nutch's). I think this issue is moot now, to be honest. I'm resolving it
(as its reporter) as such. If someone else sees a need for this, please by all means, open
up a new issue and link back.

> Use explicit tika-config.xml file to enable mime magic detection to be turned on and
off
> ----------------------------------------------------------------------------------------
>
>                 Key: NUTCH-577
>                 URL: https://issues.apache.org/jira/browse/NUTCH-577
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>    Affects Versions: 1.0.0
>         Environment: Mac Book Pro Intel Core Duo 2.0 Ghz, 2. 0 GB RAM, Mac OS X 10.4,
although improvement is indep. of env.
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>            Priority: Minor
>             Fix For: 2.0
>
>
> Currently, there is a configuration file for Tika (which the trunk in Nutch uses for
its mime type detection) called "tika-config.xml" left unexposed (a default one lives in the
tika-0.1-dev.jar file). Tika's mime system has two config files it relies on: tika-mimetypes.xml
(which Nutch has its own version of, that overrides the version that comes with the tika jar
file), and tika-config.xml (to turn on or off magic char detection). We should probably have
a nutch version of tika-config.xml, so that Nutch users can employ magic char mime detection.
I'll get going on this in the next day or so.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message