tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gabriel Valencia (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-906) Headers, footers, and footnotes not extracted from Pages documents
Date Wed, 02 May 2012 18:34:49 GMT

    [ https://issues.apache.org/jira/browse/TIKA-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13266797#comment-13266797

Gabriel Valencia commented on TIKA-906:

This document also had automatic page numbering in the footer, but that doesn't get parsed.
It's contained in the sf:p in the sf:footer as an sf:page-number. However, it only has one
of them even though there are 2 pages. I guess the rest are automatically added by Pages.
> Headers, footers, and footnotes not extracted from Pages documents
> ------------------------------------------------------------------
>                 Key: TIKA-906
>                 URL: https://issues.apache.org/jira/browse/TIKA-906
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.0
>         Environment: Windows 7
>            Reporter: Gabriel Valencia
>              Labels: iWork
>             Fix For: 1.2
>         Attachments: testPagesHeadersFootersFootnotesJIRA.pages
> Tika does not extract anything from the header or footer area and also does not extract

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message