tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nick Burch (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (TIKA-1229) Hyperlink in .doc page header broken
Date Tue, 04 Feb 2014 22:34:11 GMT

     [ https://issues.apache.org/jira/browse/TIKA-1229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Nick Burch resolved TIKA-1229.
------------------------------

       Resolution: Fixed
    Fix Version/s: 1.5

Should be fixed as of r1564540. Ended up being a bit more work than first anticipated, as
we were processing headers and footers in a very simplistic way, which has now been replaced
with handling them as proper ranges

> Hyperlink in .doc page header broken
> ------------------------------------
>
>                 Key: TIKA-1229
>                 URL: https://issues.apache.org/jira/browse/TIKA-1229
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.4
>            Reporter: Lutz Theurer
>            Priority: Minor
>             Fix For: 1.5
>
>         Attachments: mail.doc
>
>
> If you have a hyperlink to a webpage or mailto in the page header (german: Kopfzeile)
of your .doc document the import is defaced like this:
>  �HYPERLINK "http://tw-systemhaus.de" �http://tw-systemhaus.de�
> It's however not an issue in text.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Mime
View raw message