tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Adam Wilmer (JIRA)" <j...@apache.org>
Subject [jira] Commented: (TIKA-408) Word 6.0/7.0 documents support in office parser
Date Tue, 14 Sep 2010 14:37:33 GMT

    [ https://issues.apache.org/jira/browse/TIKA-408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12909268#action_12909268
] 

Adam Wilmer commented on TIKA-408:
----------------------------------

I see POI 3.7-beta2 with this change is released and the tika dependency updated. Is there
any update on enabling support for the older word format in tika? currently the following
exception is being thrown

Caused by: org.apache.poi.hwpf.OldWordFileFormatException: The document is too old - Word
95 or older. Try HWPFOldDocument instead?

Happy to offer any assistance if i can be of help.

> Word 6.0/7.0 documents support in office parser
> -----------------------------------------------
>
>                 Key: TIKA-408
>                 URL: https://issues.apache.org/jira/browse/TIKA-408
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.7
>            Reporter: Dmitry Kuzmenko
>            Assignee: Nick Burch
>            Priority: Minor
>         Attachments: testWORD6.doc, word6.patch.gz
>
>
> Current office parser doesn't support old Word 6.0/7.0 documents.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message