tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1502) Mime magic for database file formats
Date Tue, 23 Dec 2014 07:01:13 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14256675#comment-14256675
] 

Hudson commented on TIKA-1502:
------------------------------

SUCCESS: Integrated in tika-trunk-jdk1.6 #369 (See [https://builds.apache.org/job/tika-trunk-jdk1.6/369/])
Fix test for TIKA-1502 - re-order the MediaTypeRegistry logic for getting the super type,
so that if an explicit inheritance has been defined between one parametered type and another,
that inheritance is used in preference to "drop all parameters" (nick: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1647489)
* /tika/trunk/tika-core/src/main/java/org/apache/tika/mime/MediaTypeRegistry.java
* /tika/trunk/tika-core/src/test/java/org/apache/tika/mime/MimeTypesReaderTest.java
* /tika/trunk/tika-parsers/src/test/java/org/apache/tika/mime/TestMimeTypes.java
Split the Berkeley DB mimetypes into three levels, and add a detection test (passes) and a
heirarchy test (disabled as fails) TIKA-1502 (nick: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1647486)
* /tika/trunk/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml
* /tika/trunk/tika-core/src/test/java/org/apache/tika/mime/MimeTypesReaderTest.java
* /tika/trunk/tika-parsers/src/test/java/org/apache/tika/mime/TestMimeTypes.java
Start on magic for subtypes of Berkeley DB TIKA-1502 (nick: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1647485)
* /tika/trunk/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml


> Mime magic for database file formats
> ------------------------------------
>
>                 Key: TIKA-1502
>                 URL: https://issues.apache.org/jira/browse/TIKA-1502
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime
>    Affects Versions: 1.6
>            Reporter: Nick Burch
>             Fix For: 1.7
>
>
> I noticed today that Tika can't detect a lot of common database formats, such as sqlite
or Berkeley DB or MISAM
> The unix file utility got most of those, which makes me think that there's a sensible-ish
header on most we can write some mime magic for
> It'd therefore be good to add mime entries, with magic where possible, for many of these
common database file formats



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message