tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Antoni Mylka (JIRA)" <j...@apache.org>
Subject [jira] Commented: (TIKA-562) In tika-mimetypes.xml OpenXML types should have x-tika-ooxml as their parent
Date Wed, 01 Dec 2010 07:30:11 GMT

    [ https://issues.apache.org/jira/browse/TIKA-562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965590#action_12965590
] 

Antoni Mylka commented on TIKA-562:
-----------------------------------

Your unit tests test identification by name and by data. This problem is exhibited when you
try to identify a file using both name and data (quite a common case). The patch modifies
five mime type definitions. Four of them already have their examples in test-documents. These
are: testEXCEL.xlsb, testPPT.pptm, testPPT.potm, testPPT.pptm. Only ppam is missing. Will
see what I can do.

> In tika-mimetypes.xml OpenXML types should have x-tika-ooxml as their parent
> ----------------------------------------------------------------------------
>
>                 Key: TIKA-562
>                 URL: https://issues.apache.org/jira/browse/TIKA-562
>             Project: Tika
>          Issue Type: Bug
>            Reporter: Antoni Mylka
>         Attachments: ooxml-children.patch
>
>
> A couple of file types have application/x-tika-msoffice as their parent, when they should
have application/x-tika-ooxml. This error is exhibited when you try to identify those files
with both name and data. The data is found to be x-tika-ooxml, while the type found with the
name is correct, but since it's not a subtype of x-tika-ooxml - it is not returned.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message