tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael McCandless (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (TIKA-923) iWork keynote Tables are not being parsed
Date Fri, 18 May 2012 09:40:09 GMT

     [ https://issues.apache.org/jira/browse/TIKA-923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Michael McCandless updated TIKA-923:

    Attachment: testTables.key

I created this simple test case (attached) but I see the text inside the table cells being
correctly extracted on Tika's current trunk (rev 1340046).  I'll commit this as a test case...

Erik can you attach an example Keynote table that doesn't extract correctly?  Thanks.
> iWork keynote Tables are not being parsed 
> ------------------------------------------
>                 Key: TIKA-923
>                 URL: https://issues.apache.org/jira/browse/TIKA-923
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.0
>         Environment: Windows 7, 64 bit
>            Reporter: Erik Peterson
>            Priority: Critical
>              Labels: iwork
>         Attachments: testTables.key
> iWork Keynote slides can contain tables, however these are being dropped entirely by
the Tika parser.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message