tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mihai Glont (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-2194) matlab files detected as 'text/plain'
Date Mon, 12 Dec 2016 10:10:59 GMT

    [ https://issues.apache.org/jira/browse/TIKA-2194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15741535#comment-15741535
] 

Mihai Glont commented on TIKA-2194:
-----------------------------------

thanks for the feedback, Nick! I thought this line

{code:java}
    metadata.set(Metadata.RESOURCE_NAME_KEY, f.name)
{code}

was telling Tika the file's name. I recall seeing this in Tika's own unit tests for assertType.

Should I use something else instead?

> matlab files detected as 'text/plain'
> -------------------------------------
>
>                 Key: TIKA-2194
>                 URL: https://issues.apache.org/jira/browse/TIKA-2194
>             Project: Tika
>          Issue Type: Bug
>          Components: detector, mime
>    Affects Versions: 1.9, 1.14
>            Reporter: Mihai Glont
>
> matlab files from https://issues.apache.org/jira/browse/TIKA-1634 are reported to have
mime type 'text/plain' with either DefaultDetector or MimeTypes. I am able to reproduce the
problem by running the following Groovy script https://gist.github.com/mglont/16630c8a66fdddaaa7aa44820d6f021f



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message