tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1563) Use .gz as the default extension for application/gzip
Date Sun, 01 Mar 2015 18:44:05 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14342379#comment-14342379

Hudson commented on TIKA-1563:

SUCCESS: Integrated in tika-trunk-jdk1.7 #521 (See [https://builds.apache.org/job/tika-trunk-jdk1.7/521/])
TIKA-1563 Put the more common gzip file extension (.gz) first in the glob list (nick: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1663138)
* /tika/trunk/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml

> Use .gz as the default extension for application/gzip
> -----------------------------------------------------
>                 Key: TIKA-1563
>                 URL: https://issues.apache.org/jira/browse/TIKA-1563
>             Project: Tika
>          Issue Type: Bug
>          Components: core
>            Reporter: Adam Lamar
>            Priority: Minor
>             Fix For: 1.8
> This ticket is in reference to the following code:
> MimeType mimeType = config.getMimeRepository().forName("application/gzip"); 
> String extension = mimeType.getExtension();
> The string extension will be ".tgz". While it is possible to get .gz or other common
extensions with mimeType.getExtensions(), it would be nice if the default was .gz. Every .tgz
is an application/gzip, but every application/gzip is not a .tgz. 
> Note that although the code above uses the static mime type application/gzip, it is more
likely that real-world code is performing mime type detection on specific files and looking
up the returned type in the mime registry. Returning .gz for a gzipped tarball isn't wrong,
but is less specific than its real type. However, returning .tgz for gzipped text is incorrect.

This message was sent by Atlassian JIRA

View raw message