tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1502) Mime magic for database file formats
Date Tue, 23 Dec 2014 03:42:13 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14256561#comment-14256561
] 

Hudson commented on TIKA-1502:
------------------------------

SUCCESS: Integrated in tika-trunk-jdk1.7 #383 (See [https://builds.apache.org/job/tika-trunk-jdk1.7/383/])
TIKA-1502 MySQL and SQLite3 mime types, with magic where possible (nick: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1647478)
* /tika/trunk/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml
Some test database files for TIKA-1502 (nick: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1647473)
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_2.db
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_3.db
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_4.db
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_5.db
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testMYSQL.MYD
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testMYSQL.MYI
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testMYSQL.frm
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testSQLITE3.db


> Mime magic for database file formats
> ------------------------------------
>
>                 Key: TIKA-1502
>                 URL: https://issues.apache.org/jira/browse/TIKA-1502
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime
>    Affects Versions: 1.6
>            Reporter: Nick Burch
>
> I noticed today that Tika can't detect a lot of common database formats, such as sqlite
or Berkeley DB or MISAM
> The unix file utility got most of those, which makes me think that there's a sensible-ish
header on most we can write some mime magic for
> It'd therefore be good to add mime entries, with magic where possible, for many of these
common database file formats



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message