tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Allison (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (TIKA-2576) Add application/zstd detection and parser
Date Wed, 07 Mar 2018 00:56:00 GMT

     [ https://issues.apache.org/jira/browse/TIKA-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Tim Allison resolved TIKA-2576.
-------------------------------
       Resolution: Fixed
    Fix Version/s: 2.0.0
                   1.18

Thank you, [~AndreasMeier]!

> Add application/zstd detection and parser
> -----------------------------------------
>
>                 Key: TIKA-2576
>                 URL: https://issues.apache.org/jira/browse/TIKA-2576
>             Project: Tika
>          Issue Type: Improvement
>          Components: detector, parser
>            Reporter: Andreas Meier
>            Priority: Minor
>             Fix For: 1.18, 2.0.0
>
>         Attachments: huffman-compressed-larger, huffmann-compressed-larger-result.txt
>
>
> The IETF is currently checking the specification of Zstandard compression and the application/zstd
Media Type: [https://tools.ietf.org/id/draft-kucherawy-dispatch-zstd-01.html|https://tools.ietf.org/id/draft-kucherawy-dispatch-zstd-01.html]
> As soon as the MediaType application/zstd is set as standard the Media Type shall be
implemented.
> Possible mime-detection for tika-mimetypes.xml (second comment has to be changed when
the standard is final):
> {code:xml}
>   <mime-type type="application/zstd">
>     <_comment>https://en.wikipedia.org/wiki/Zstandard</_comment>
>     <_comment>https://tools.ietf.org/id/draft-kucherawy-dispatch-zstd-01.html</_comment>
>     <magic priority="50">
>       <match value="0xFD2FB528" type="little32" offset="0"/>
>     </magic>
>     <glob pattern="*.zstd"/>
>   </mime-type>
> {code}
> commons-compress version 1.16 and later provide a compressor and decompressor for the
algorithm, based on com.github.luben zstd-jni [https://github.com/luben/zstd-jni|https://github.com/luben/zstd-jni]
> Attached sampe zstd file (huffman-compressed-larger) and the result after decompressing
it.
> Decompression was done with commons-compress 1.16.1 and zstd-jni 1.3.3-3
> {code:xml}
> <dependency>
>   <groupId>org.apache.commons</groupId>
>   <artifactId>commons-compress</artifactId>
>   <version>1.16.1</version>
> </dependency>
> <dependency>
>   <groupId>com.github.luben</groupId>
>   <artifactId>zstd-jni</artifactId>
>   <version>1.3.3-3</version>
> </dependency>
> {code}
> Regards
> Andreas



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message