tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Allison (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-2469) False positives with x-ms-owner detection
Date Wed, 11 Oct 2017 14:04:00 GMT

    [ https://issues.apache.org/jira/browse/TIKA-2469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16200327#comment-16200327
] 

Tim Allison commented on TIKA-2469:
-----------------------------------

I think we can require that there be no control characters (aside from \x00) in the first
x bytes?

> False positives with x-ms-owner detection
> -----------------------------------------
>
>                 Key: TIKA-2469
>                 URL: https://issues.apache.org/jira/browse/TIKA-2469
>             Project: Tika
>          Issue Type: Bug
>          Components: mime
>    Affects Versions: 1.15, 1.16
>            Reporter: Luis Filipe Nassif
>            Assignee: Tim Allison
>            Priority: Minor
>         Attachments: C_20106.NLS, C_20297.NLS, x86_microsoft-windows-i..tional-codepage-870_31bf38.nls_c0c54318
>
>
> Attached windows system files are incorrectly detected as application/x-ms-owner. [~tallison@apache.org]
did you add the magic for x-ms-owner? Is it possible to make the magic regex more strict?



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message