tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris A. Mattmann (JIRA)" <j...@apache.org>
Subject [jira] Updated: (TIKA-6) Port Nutch (or better) MimeType detection system into Tika
Date Fri, 21 Sep 2007 04:04:50 GMT

     [ https://issues.apache.org/jira/browse/TIKA-6?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Chris A. Mattmann updated TIKA-6:
---------------------------------

    Attachment: TIKA-6.Mattmann.092007.patch.txt

Hey Guys:

Updated patch that tries to address comments by Jukka and Bertrand. Thanks for the comments
folks.

If there are no objections to this, I'd like to get it committed within the next 48 hrs.

Additionally, once committed, it would be great to get some responses to my early email regarding
configuring TIKA. It's the info that I need to know to link up the mime type detection here
from TIKA-6 to the parser stuff contributed by Rida.

Thanks,
  Chris


> Port Nutch (or better) MimeType detection system into Tika
> ----------------------------------------------------------
>
>                 Key: TIKA-6
>                 URL: https://issues.apache.org/jira/browse/TIKA-6
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.1-incubator
>         Environment: Improvement is indep. of environment
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 0.1-incubator
>
>         Attachments: TIKA-6.Mattmann.091907.patch.txt, TIKA-6.Mattmann.092007.patch.txt
>
>
> This patch will contribute a MimeType detection system for Tika, including MImeType data
structure, and associated content-detection facilities. This will be based on Nutch's MimeType
system as a baseline, however, I'm open to suggestions. Jerome Charron mentioned that he had
an implementation of a MimeType system based on FreeDesktop.org's system. We should look into
this as well.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message