tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Johan (Jira)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-3007) Heic images are detected as "application/mp4" when using tika as server
Date Tue, 17 Dec 2019 11:47:00 GMT

    [ https://issues.apache.org/jira/browse/TIKA-3007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16998119#comment-16998119

Johan commented on TIKA-3007:


Ok i understand it a bit better now but don't you still think that -j (metadata as json) should
return the json version of -m when you run that on the heic/heif image file? Cause to me that
seems to be broken if you compare that to calling -j and -m on another file type which return
the same.

Is there also a way to know if there is a Parser available for a certain file type or not?
And is there any rationale to not add the mime type detection as well to the metadata output?
Cause now we need to at least make 2 calls to first detect the type and then get the metadata

> Heic images are detected as "application/mp4" when using tika as server
> -----------------------------------------------------------------------
>                 Key: TIKA-3007
>                 URL: https://issues.apache.org/jira/browse/TIKA-3007
>             Project: Tika
>          Issue Type: Bug
>          Components: server
>    Affects Versions: 1.23
>            Reporter: Johan
>            Priority: Blocker
> Related to https://issues.apache.org/jira/browse/TIKA-2942
> It seems the detection of the heic imags is working for the standalone jar (tika-app-1.23)
but not for the server component (tika-server-1.23).
> tika-app-1.23.jar from [https://archive.apache.org/dist/tika/] detects the image with
image/heic but it does not work for the server component tika-server-1.23.jar that one returns
still "application/mp4". Any clue what might be going wrong? Code has been added only to
the tika jar client and not to the server?

This message was sent by Atlassian Jira

View raw message