tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ken Krugler (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (TIKA-869) IdentityHtmlMapper.mapSafeElement() needs to return lower-cased incoming name
Date Thu, 09 Aug 2012 21:55:19 GMT

     [ https://issues.apache.org/jira/browse/TIKA-869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Ken Krugler resolved TIKA-869.

       Resolution: Fixed
    Fix Version/s: 1.3

> IdentityHtmlMapper.mapSafeElement() needs to return lower-cased incoming name
> -----------------------------------------------------------------------------
>                 Key: TIKA-869
>                 URL: https://issues.apache.org/jira/browse/TIKA-869
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>            Reporter: Ken Krugler
>            Assignee: Ken Krugler
>             Fix For: 1.3
>         Attachments: TIKA-869.patch
> Currently IdentityHtmlMapper.mapSafeElement(String name) just returns name as-is. This
makes the XHTMLContentHandler think that it hasn't received a <body> tag, since it assumes
input is lower-cased. So you get output that looks like:
> <body><BODY/></body></html>
> The solution is a trivial change to lower-case the incoming name, the same as what the
mapSafeAttribute() method is already doing.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message