tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Keith R. Bennett (JIRA)" <j...@apache.org>
Subject [jira] Updated: (TIKA-14) MimeTypeUtils.getMimeType() returns the default mime type for .odt (Open Office) files.
Date Fri, 14 Sep 2007 15:59:32 GMT

     [ https://issues.apache.org/jira/browse/TIKA-14?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Keith R. Bennett updated TIKA-14:
---------------------------------

    Attachment: tika-14.patch

Adds the Open Office file extension to the list of support file types, with:

+        } else if (name.endsWith(".odt")) {
+            return "application/vnd.oasis.opendocument.text";

Also adds more thorough testing in the test class.


> MimeTypeUtils.getMimeType() returns the default mime type for .odt (Open Office) files.
> ---------------------------------------------------------------------------------------
>
>                 Key: TIKA-14
>                 URL: https://issues.apache.org/jira/browse/TIKA-14
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.1-incubator
>            Reporter: Keith R. Bennett
>             Fix For: 0.1-incubator
>
>         Attachments: tika-14.patch
>
>
> MimeTypeUtils.getMimeType() returns the default mime type for .odt (Open Office) files.
> Because of this, it is not possible to parse OpenOffice files at this time.  I did some
brief research, and could not find a mime type for Open Office files.  There was a comment
that the mime type associated with these files is application/zip, since Open Office document
files are zipped files.  That, of course, will not help us, since it would not be reasonable
for us to assume that all zip files have Open Office content.
> It is possible that there is now a mime type for Open Office documents, and I just could
not find it.  (I hope so.)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message