tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Allison (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-2264) Better handling of footnotes/endnotes for ODF files
Date Mon, 13 Feb 2017 13:37:41 GMT

    [ https://issues.apache.org/jira/browse/TIKA-2264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15863677#comment-15863677
] 

Tim Allison commented on TIKA-2264:
-----------------------------------

These two are likely related.

> Better handling of footnotes/endnotes for ODF files
> ---------------------------------------------------
>
>                 Key: TIKA-2264
>                 URL: https://issues.apache.org/jira/browse/TIKA-2264
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.14
>         Environment: N/A
>            Reporter: Mike Rodent
>            Priority: Minor
>              Labels: newbie
>         Attachments: ImprovedODFContentParser.java
>
>
> Springs from my question here (http://stackoverflow.com/questions/42031237/modify-apache-tika-parsing-of-old-1997-2003-ms-word-docs)
... I have improved the class OpenDocumentContentParser so that it puts footnotes/endnotes
at the end of the line to which they belong and doesn't break up the line in question.  As
with .docx parsing the notes can be linked to the reference easily.  The respondee in Stack
Overflow suggested I open an issue here... 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message