tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andreas Beeker (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1707) Upgrade to Apache POI 3.13 Beta 2
Date Wed, 21 Oct 2015 22:56:27 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14968118#comment-14968118

Andreas Beeker commented on TIKA-1707:

I would replace it with the empty string and use the regex escape for the line break
    fragment.replaceFirst("\\r$", "")

Apart of that, I've added a patch for bullet lists.
Currently HSLF always returns false for super/subscript ... I need to change this in POI.

Please comment, if it makes sense to add further markup information.

> Upgrade to Apache POI 3.13 Beta 2
> ---------------------------------
>                 Key: TIKA-1707
>                 URL: https://issues.apache.org/jira/browse/TIKA-1707
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.9
>            Reporter: Andreas Beeker
>            Assignee: Tim Allison
>         Attachments: common_sl.diff
> In the not so far future, POI 3.13 Beta 2 will be available.
> This contains a quite big change to the Powerpoint modules XSLF/HSLF, but thankfully
TIKA isn't much affected.
> Please try the patch on our trunk and post side-effects.
> As the work on the common_sl api hasn't been finished yet, there might be another patch
for the next POI beta version.

This message was sent by Atlassian JIRA

View raw message