tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Amit Kumar (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-2249) Tika not able to parse tables from pdf
Date Mon, 23 Jan 2017 14:33:26 GMT

    [ https://issues.apache.org/jira/browse/TIKA-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15834661#comment-15834661
] 

Amit Kumar commented on TIKA-2249:
----------------------------------

So when Tika claims to parse pdf to HTML and in the resultant HTML if tables are not preserved
i.e. no table tags then isn't it wrong?

Thanks for showing where to attach file. Done :)

> Tika not able to parse tables from pdf 
> ---------------------------------------
>
>                 Key: TIKA-2249
>                 URL: https://issues.apache.org/jira/browse/TIKA-2249
>             Project: Tika
>          Issue Type: Bug
>          Components: handler
>            Reporter: Amit Kumar
>         Attachments: Japanese.pdf
>
>
> Tika not able to parse tables from pdf. I want to attach sample pdf which I tried but
attachment/browse link is not visible to me.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message