tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "James Baker (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1427) PDF Images don't appear in structured view
Date Fri, 03 Oct 2014 11:58:33 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157916#comment-14157916
] 

James Baker commented on TIKA-1427:
-----------------------------------

Image extraction is working and <img> tags are being inserted into the structured view,
but it is inserting them at the bottom of each page. Is it not possible to have them inserted
at the correct location within the document?

> PDF Images don't appear in structured view
> ------------------------------------------
>
>                 Key: TIKA-1427
>                 URL: https://issues.apache.org/jira/browse/TIKA-1427
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.6
>            Reporter: James Baker
>            Assignee: Tim Allison
>              Labels: pdf
>
> When viewing, say, a Word Document, any images appear in the 'structured view' of the
document as <img> tags. The same is not true of PDF documents, and we lose both the
fact that there is an image present, and where it is in the document.
> Some discussion of this issue in the comments of TIKA-1396.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message