lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Teruhiko Kurosaka" <K...@basistech.com>
Subject RE: Facet only support english?
Date Thu, 10 May 2007 21:49:32 GMT
If my memory is correct,  UTF-8 has been the default encoding per
XML specification from a very early stage. If the XML parser is not
defaulting 
to UTF-8 in absence of the encoding attribute, that means the XML
parser has a bug, and the code should be corrected.

(I don't have an objection to add the encoding attribute for clarity,
however.)
-kuro

> -----Original Message-----
> From: Walter Underwood [mailto:wunderwood@netflix.com] 
> Sent: Wednesday, May 09, 2007 4:33 PM
> To: solr-user@lucene.apache.org
> Subject: Re: Facet only support english?
> 
> I didn't remember that requirement, so I looked it up. It was added
> in XML 1.0 2nd edition. Originally, unspecified encodings were open
> for auto-detection.
> 
> Content type trumps encoding declarations, of course, per RFC 3023
> and allowed by the XML spec.
> 
> wunder
> 
> On 5/9/07 4:19 PM, "Mike Klaas" <mike.klaas@gmail.com> wrote:
> 
> > I thought that conformant parsers use UTF-8 as the default anyway:
> > 
> > http://www.w3.org/TR/REC-xml/#charencoding
> > 
> > -Mike
> 
> 

Mime
View raw message