httpd-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Antonín Zrůstek <Antonin.Zrus...@corpus.cz>
Subject [users@httpd] Bad charset conversion with mod_proxy
Date Thu, 18 Jul 2013 12:15:55 GMT
Hello everybody,

while reverse proxying a page via mod_proxy_html (Apache 2.4.4) I get 
wrong output charset for some parts of page. When I comment Include 
conf/extra/proxy-html.conf the page is OK, but of course rewriting 
internal links doesn't work. When I uncomment it, rewriting works just 
OK, but charset in one button and in a posted form data is bad.

Config:
Apache 2.4.4 self compiled
Solaris 10

LoadFile   /opt/csw/lib/libxml2.so
LoadModule proxy_html_module modules/mod_proxy_html.so
LoadModule xml2enc_module modules/mod_xml2enc.so

...

  ProxyHTMLDoctype XHTML
  ProxyRequests off
  xml2EncDefault UTF-8 #or commented

       ProxyPass /bohdalec/ http://10.16.133.38:20001/obj/   max=320

         <Location /bohdalec>

         ProxyPassReverse /
           ProxyHTMLEnable On
           ProxyHTMLURLMap /            /bohdalec/
           ProxyPassReverseCookiePath    /  /bohdalec/
         </Location>

I have found some clues in error log that I suppose it can be the reason.

In default setup:

[Thu Jul 18 14:14:39.753939 2013] [xml2enc:debug] [pid 13753:tid 26] 
mod_xml2enc.c(178): [client 10.16.8.40:62379] AH01430: Content-Type is 
text/html, referer: http://registrace.praha.eu/bohdalec/wc-objini.php
[Thu Jul 18 14:14:39.753979 2013] [xml2enc:debug] [pid 13753:tid 26] 
mod_xml2enc.c(254): [client 10.16.8.40:62379] AH01434: Charset 
ISO-8859-1 not supported by libxml2; trying apr_xlate, referer: 
http://registrace.praha.eu/bohdalec/wc-objini.php
[Thu Jul 18 14:14:39.754001 2013] [xml2enc:error] [pid 13753:tid 26] 
[client 10.16.8.40:62379] AH01435: Charset ISO-8859-1 not supported.  
Consider aliasing it?, referer: 
http://registrace.praha.eu/bohdalec/wc-objini.php
[Thu Jul 18 14:14:39.754021 2013] [xml2enc:warn] [pid 13753:tid 26] 
[client 10.16.8.40:62379] AH01436: No usable charset information; using 
configuration default, referer: 
http://registrace.praha.eu/bohdalec/wc-objini.php

When using  xml2EncDefault UTF-8 option:

[Thu Jul 18 14:12:17.563736 2013] [xml2enc:debug] [pid 13739:tid 25] 
mod_xml2enc.c(178): [client 10.16.8.40:62324] AH01430: Content-Type is 
text/html, referer: http://registrace.praha.eu/bohdalec/wc-objini.php
[Thu Jul 18 14:12:17.563774 2013] [xml2enc:debug] [pid 13739:tid 25] 
mod_xml2enc.c(254): [client 10.16.8.40:62324] AH01434: Charset UTF-8 not 
supported by libxml2; trying apr_xlate, referer: 
http://registrace.praha.eu/bohdalec/wc-objini.php
[Thu Jul 18 14:12:17.563796 2013] [xml2enc:error] [pid 13739:tid 25] 
[client 10.16.8.40:62324] AH01435: Charset UTF-8 not supported. Consider 
aliasing it?, referer: http://registrace.praha.eu/bohdalec/wc-objini.php
[Thu Jul 18 14:12:17.563815 2013] [xml2enc:warn] [pid 13739:tid 25] 
[client 10.16.8.40:62324] AH01436: No usable charset information; using 
configuration default, referer: 
http://registrace.praha.eu/bohdalec/wc-objini.php


If I read it well, the problem is that libxml2 doesn't support neither 
ISO-8859-1 or UTF-8. But according to documentation, they are always 
supported and UTF-8 is the default encoding. I have tried 3 versions of 
libxml2 - default Solaris version, CSW version and self compiled one. I 
doubt all three versions could have such a basic problem as not 
supporting default charset. Do I miss something important?

Thanks for any help

Toni

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@httpd.apache.org
For additional commands, e-mail: users-help@httpd.apache.org


Mime
View raw message