rave-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jasha Joachimsthal <ja...@apache.org>
Subject Re: [DISCUSS] Changing Rave charset to UTF-8
Date Wed, 25 Jul 2012 20:23:30 GMT
On 25 July 2012 21:45, Carlucci, Tony <acarlucci@mitre.org> wrote:

> >-----Original Message-----
> >From: jashaj@gmail.com [mailto:jashaj@gmail.com] On Behalf Of Jasha
> >Joachimsthal
> >Sent: Wednesday, July 25, 2012 3:09 PM
> >To: dev@rave.apache.org
> >Subject: Re: [DISCUSS] Changing Rave charset to UTF-8
> >
> >On 25 July 2012 20:54, Carlucci, Tony <acarlucci@mitre.org> wrote:
> >
> >> Hi Rave Devs,
> >>
> >> Is there any reason why we *shouldn't* change the Rave code base to
> serve
> >> out UTF-8 character sets instead of ISO-8859-1?
> >>
> >> Thanks, Tony
> >>
> >
> >Because you may run into issues when doing form submits and url's with
> >query strings. If you change the encoding, please do check that all the
> >form submits (both POST and GET) and AJAX calls are treated correctly with
> >diacritics. English doesn't use them a lot, but other (European) languages
> >do.
> >The default encoding for servlet requests and responses seems to be
> >ISO-8859-1: http://wiki.apache.org/tomcat/FAQ/CharacterEncoding
> >
> >Is something going wrong with encoding at the moment?
> >
> >Jasha
> No, nothing is wrong, just something I noticed while trying to debug our
> internal Apache/Tomcat issues.  Our Tomcat connector is configured for
> UTF-8 so I thought there might have been a conflict with the default
> ISO-8859-1 encoding we are using.  Changing it didn't solve our problem,
> but never the less I didn't know if we should be serving UTF-8 or not
> out-of-the-box to support non-Latin based charsets?
> Perhaps instead of forcibly changing it to UTF-8, we could make it a
> configurable property?

Finding out what goes wrong with encoding is always a PITA. It can be
configured in many levels of the request handling: the httpd server, Tomcat
connector, in a servlet or Spring's CharacterEncodingFilter, in the JSP (I
may have forgotten a layer or 2). Then the database may or may not handle
the data correctly.
UTF-8 may be a better choice for non-European character sets, but it should
be tested carefully.

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message