tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nick Burch (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (TIKA-289) Add magic byte patterns from file(1)
Date Sun, 01 Mar 2015 14:08:04 GMT

     [ https://issues.apache.org/jira/browse/TIKA-289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Nick Burch updated TIKA-289:
    Attachment: file-has-magic-tika-missing.txt

{{file-has-magic-tika-missing.txt}} is the list of mime types where file(1) has magic but
Tika does not, where both know about the same mime type. Note that there may be some false
positives on this list, eg where Tika has the magic on a parent type

> Add magic byte patterns from file(1)
> ------------------------------------
>                 Key: TIKA-289
>                 URL: https://issues.apache.org/jira/browse/TIKA-289
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime
>            Reporter: Jukka Zitting
>            Priority: Minor
>         Attachments: file-has-magic-tika-missing.txt, file-mimes-missing.txt
> As discussed in TIKA-285, the file(1) command comes with a pretty comprehensive set of
magic byte patterns. It would be nice to get those patterns included also in Tika.

This message was sent by Atlassian JIRA

View raw message