tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Allison (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1427) PDF Images don't appear in structured view
Date Fri, 10 Oct 2014 12:26:33 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14166714#comment-14166714
] 

Tim Allison commented on TIKA-1427:
-----------------------------------

[~tilman], well, sure, but you actually know what you're talking about.  :)

I misused the term inline...I intended that to be images that are rendered in the document
vs. images that are attached as attachments, but you're right, there's an important distinction
in PDF docs.  Thank you for your diagnosis!

@James Baker, sorry, can't fix this.

> PDF Images don't appear in structured view
> ------------------------------------------
>
>                 Key: TIKA-1427
>                 URL: https://issues.apache.org/jira/browse/TIKA-1427
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.6
>            Reporter: James Baker
>            Assignee: Tim Allison
>              Labels: pdf
>         Attachments: images_test.pdf
>
>
> When viewing, say, a Word Document, any images appear in the 'structured view' of the
document as <img> tags. The same is not true of PDF documents, and we lose both the
fact that there is an image present, and where it is in the document.
> Some discussion of this issue in the comments of TIKA-1396.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message