tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Fabian Lange (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1358) Add support for newer iWork file formats
Date Fri, 18 Jul 2014 12:31:05 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14066303#comment-14066303
] 

Fabian Lange commented on TIKA-1358:
------------------------------------

Hi Nick,

I can convert the existing keynote/numbers/pages files. The "problem" with the new files is
that they are Apple "Packages". To most tools this looks like directories.
Shall I provide a zipped version of the directory (this is how many work around the issue)?

> Add support for newer iWork file formats
> ----------------------------------------
>
>                 Key: TIKA-1358
>                 URL: https://issues.apache.org/jira/browse/TIKA-1358
>             Project: Tika
>          Issue Type: Wish
>          Components: parser
>    Affects Versions: 1.5
>            Reporter: Jelle Kastelein
>              Labels: newbie
>
> IWork 2013 uses a revised file format which replaces the xml files that hold the content
by .iwa files (a binary format). This file format is becoming increasingly relevant as more
and more people are using apple products. However, it does not appear to work with the current
IWorkPackageParser (tested with several of the example .pages files one can get from the iCloud).




--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message