tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1882) Scientific MIME updates to .cab files, .xar and .mobi and .mov files based on TREC-DD-Polar analysis
Date Tue, 19 Apr 2016 05:24:25 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15247213#comment-15247213
] 

ASF GitHub Bot commented on TIKA-1882:
--------------------------------------

Github user asfgit closed the pull request at:

    https://github.com/apache/tika/pull/82


> Scientific MIME updates to .cab files, .xar and .mobi and .mov files based on TREC-DD-Polar
analysis
> ----------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-1882
>                 URL: https://issues.apache.org/jira/browse/TIKA-1882
>             Project: Tika
>          Issue Type: Sub-task
>          Components: mime
>    Affects Versions: 1.11
>            Reporter: Manisha Kampasi
>            Assignee: Chris A. Mattmann
>            Priority: Minor
>              Labels: memex, nsfpolar, patch
>             Fix For: 1.13
>
>
> The following mime magic can be added to better detect the below mime-types:
> 1. vnd.ms-cab-compressed (.cab files) - pattern "MCSF" in the first 4 bytes
> 2. application/vnd.xara (.xar files) - pattern "xar!" in the first 4 bytes
> 3. application/x-mobipocket-ebook (.mobi files) - pattern "BOOKMOBI" starting at byte
position 60
> 4. video/quicktime (.mov files) - patterns "free" and "wide" seen starting at byte position
4
> The changes can be seen here:
> https://github.com/mkampasi/tika/commit/f7433daf434a44937ba3ae8b15813a768f95e334



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message