tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Amit Kumar (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-2249) Tika not able to parse tables from pdf
Date Wed, 01 Feb 2017 11:21:51 GMT

    [ https://issues.apache.org/jira/browse/TIKA-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15848246#comment-15848246

Amit Kumar commented on TIKA-2249:

[~tallison@mitre.org] : I tried Aspose.Pdf for Java(free trial version), it works quite well
in converting pdf into html. To use it seemlessly I would have to buy the license, ahhh...
Just wished we could have such solutions in open source as well. Definitely Aspose team has
done something which Tika is lacking and can build upon.

> Tika not able to parse tables from pdf 
> ---------------------------------------
>                 Key: TIKA-2249
>                 URL: https://issues.apache.org/jira/browse/TIKA-2249
>             Project: Tika
>          Issue Type: Bug
>          Components: handler
>            Reporter: Amit Kumar
>         Attachments: Japanese.pdf
> Tika not able to parse tables from pdf. I want to attach sample pdf which I tried but
attachment/browse link is not visible to me.

This message was sent by Atlassian JIRA

View raw message