tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Allison (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (TIKA-2352) Incorrect EOF exception in WordPerfect parser
Date Tue, 02 May 2017 13:29:04 GMT

     [ https://issues.apache.org/jira/browse/TIKA-2352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Tim Allison updated TIKA-2352:
    Attachment: 462321.wp

Triggering file.  I think something is going wrong with  {{skipUntilChar}}.  I think we're
accidentally skipping too far and then a multibyte function is incorrectly perceived, leading
to the false expectation that the parser should skip 60430 bytes.

> Incorrect EOF exception in WordPerfect parser
> ---------------------------------------------
>                 Key: TIKA-2352
>                 URL: https://issues.apache.org/jira/browse/TIKA-2352
>             Project: Tika
>          Issue Type: Bug
>            Reporter: Tim Allison
>            Priority: Trivial
>         Attachments: 462321.wp
> We have a few EOF exceptions in WordPerfect files that are likely not truncated.  The
example I'll attach shortly is able to be opened without complaint by LibreOffice.

This message was sent by Atlassian JIRA

View raw message