[ https://issues.apache.org/jira/browse/TIKA-1439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14165065#comment-14165065
]
Tim Allison commented on TIKA-1439:
-----------------------------------
Hi [~sunxingzhe359],
Thanks to your post with test file on PDFBOX-2393 and [~tilman]'s pinging me and supplying
example code, I think I fixed this issue in trunk with TIKA-1433. If trunk doesn't work for
your test file, let me know; otherwise, I'll close this out as a duplicate in a few days.
Thank you, again!
Best,
Tim
> PDF embeded with document can not parse.
> ----------------------------------------
>
> Key: TIKA-1439
> URL: https://issues.apache.org/jira/browse/TIKA-1439
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.6
> Environment: windows7
> Reporter: sunxingzhe
> Labels: pdfbox
> Attachments: PDF2XHTML.java_diff.html
>
>
> I insert a Excel file into the pdf file.
> But can not extracte embedded excel resources.
> The attachment file PDF2XHTML.java_diff.html is the diff file.
> Please confirm it.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
|