tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pascal Essiembre (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-2352) Incorrect EOF exception in WordPerfect parser
Date Thu, 04 May 2017 19:02:04 GMT

    [ https://issues.apache.org/jira/browse/TIKA-2352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15997238#comment-15997238
] 

Pascal Essiembre commented on TIKA-2352:
----------------------------------------

FYI, "commoncrawl2_likely_broken\W4\W4YNRCMM3TPKQSU24LS6T2PEVWD2FU7Y" and "commoncrawl2\4L\4LCO3UGXCLRSHCKSNB2DDW3MNLE7KP3N"
definitely look broken.  They both contain no words when you open them in any text editor
(you should see "some").  One cannot be open by LibreOffice and the other appears empty when
doing so.

> Incorrect EOF exception in WordPerfect parser
> ---------------------------------------------
>
>                 Key: TIKA-2352
>                 URL: https://issues.apache.org/jira/browse/TIKA-2352
>             Project: Tika
>          Issue Type: Bug
>            Reporter: Tim Allison
>            Priority: Trivial
>             Fix For: 2.0, 1.15
>
>         Attachments: 462321.wp, reports.zip
>
>
> We have a few EOF exceptions in WordPerfect files that are likely not truncated.  The
example I'll attach shortly is able to be opened without complaint by LibreOffice.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message