lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yonik Seeley" <ysee...@gmail.com>
Subject Re: Cyrillic characters
Date Wed, 19 Jul 2006 15:58:58 GMT
On 7/19/06, Tricia Williams <pgwillia@student.cs.uwaterloo.ca> wrote:
>     You mentioned in another earlier post that q=h%c3%e9 would find
> matching hits.  My experience shows that while the UTF-8 encoded query
> doesn't generate any exceptions, no results are matched.  However
> q=h%e9llo would find matching results.

Confirmed in Tomcat 5.5.17, LOL!

So Firefox->Tomcat works for latin1 at least
and IE->Jetty also works for latin1

By my reading of the standards, UTF8 (or percent encoded UTF8 bytes)
is the only correct format for a URI to be in.

Can anyone else shed some light on this?

-Yonik

Mime
View raw message