tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris A. Mattmann (JIRA)" <j...@apache.org>
Subject [jira] Updated: (TIKA-6) Port Nutch (or better) MimeType detection system into Tika
Date Thu, 20 Sep 2007 03:16:13 GMT

     [ https://issues.apache.org/jira/browse/TIKA-6?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Chris A. Mattmann updated TIKA-6:
---------------------------------

    Attachment: TIKA-6.Mattmann.091907.patch.txt

Hi Folks:

This has been a long time coming. I've ported Jerome's new Mime Type repository (based on
freedesktop.org's mime system) to Tika, with most of the effort coming from Jerome and the
legwork to clean it up for Tika coming from me :)

Comments appreciated. If there are no objections, I'd like to commit this in the next 48 hrs.
There are some issues to talk through on this, which I'll send a separate email about soon.

> Port Nutch (or better) MimeType detection system into Tika
> ----------------------------------------------------------
>
>                 Key: TIKA-6
>                 URL: https://issues.apache.org/jira/browse/TIKA-6
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.1-incubator
>         Environment: Improvement is indep. of environment
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 0.1-incubator
>
>         Attachments: TIKA-6.Mattmann.091907.patch.txt
>
>
> This patch will contribute a MimeType detection system for Tika, including MImeType data
structure, and associated content-detection facilities. This will be based on Nutch's MimeType
system as a baseline, however, I'm open to suggestions. Jerome Charron mentioned that he had
an implementation of a MimeType system based on FreeDesktop.org's system. We should look into
this as well.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message