tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nick Burch (JIRA)" <j...@apache.org>
Subject [jira] [Created] (TIKA-1180) Matroska (mkv, mka, webm) Detector
Date Sat, 05 Oct 2013 16:22:41 GMT
Nick Burch created TIKA-1180:

             Summary: Matroska (mkv, mka, webm) Detector
                 Key: TIKA-1180
                 URL: https://issues.apache.org/jira/browse/TIKA-1180
             Project: Tika
          Issue Type: New Feature
          Components: detector
    Affects Versions: 1.5
            Reporter: Nick Burch

Following the work on TIKA-1177, we now have mimetype entries for the various formats which
are based on the Matroska container (mkv, mka, webm etc). However, we are unable to properly
identify the specific type just from some mime magic

Instead, for fully accurate detection, we'll need a new Detector for the Matroska family,
which does some very simple container/stream processing to work out what the container contains

This message was sent by Atlassian JIRA

View raw message