tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Thilo Goetz (JIRA)" <j...@apache.org>
Subject [jira] Commented: (TIKA-14) MimeTypeUtils.getMimeType() returns the default mime type for .odt (Open Office) files.
Date Thu, 13 Sep 2007 06:04:32 GMT

    [ https://issues.apache.org/jira/browse/TIKA-14?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12526993
] 

Thilo Goetz commented on TIKA-14:
---------------------------------

I'd say application/vnd.oasis.opendocument.text for .odt files, but I'm not an expert.  Here
are some links that google turned up:

http://framework.openoffice.org/documentation/mimetypes/mimetypes.html
http://books.evc-cit.info/ch01.php#mimetype-table

Here's the IANA page listing the opendocument mime types (and everything else under the sun):

http://www.iana.org/assignments/media-types/application/

There is also quite a bit of discussion around this on the OOo forums, but some of it is quite
old and predates the .odt days.

--Thilo





> MimeTypeUtils.getMimeType() returns the default mime type for .odt (Open Office) files.
> ---------------------------------------------------------------------------------------
>
>                 Key: TIKA-14
>                 URL: https://issues.apache.org/jira/browse/TIKA-14
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.1-incubator
>            Reporter: Keith R. Bennett
>             Fix For: 0.1-incubator
>
>
> MimeTypeUtils.getMimeType() returns the default mime type for .odt (Open Office) files.
> Because of this, it is not possible to parse OpenOffice files at this time.  I did some
brief research, and could not find a mime type for Open Office files.  There was a comment
that the mime type associated with these files is application/zip, since Open Office document
files are zipped files.  That, of course, will not help us, since it would not be reasonable
for us to assume that all zip files have Open Office content.
> It is possible that there is now a mime type for Open Office documents, and I just could
not find it.  (I hope so.)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message