lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From WHIRLYCOTT <p...@whirlycott.com>
Subject Re: Cyrillic characters
Date Wed, 19 Jul 2006 16:15:36 GMT
I submitted two patches that fix one problem with URL encoding and  
another with the screens on the webapp.

	http://issues.apache.org/jira/browse/SOLR-35

phil.

On Jul 19, 2006, at 11:58 AM, Yonik Seeley wrote:

> On 7/19/06, Tricia Williams <pgwillia@student.cs.uwaterloo.ca> wrote:
>>     You mentioned in another earlier post that q=h%c3%e9 would find
>> matching hits.  My experience shows that while the UTF-8 encoded  
>> query
>> doesn't generate any exceptions, no results are matched.  However
>> q=h%e9llo would find matching results.
>
> Confirmed in Tomcat 5.5.17, LOL!
>
> So Firefox->Tomcat works for latin1 at least
> and IE->Jetty also works for latin1
>
> By my reading of the standards, UTF8 (or percent encoded UTF8 bytes)
> is the only correct format for a URI to be in.
>
> Can anyone else shed some light on this?
>
> -Yonik


--
                                    Whirlycott
                                    Philip Jacob
                                    phil@whirlycott.com
                                    http://www.whirlycott.com/phil/



Mime
View raw message