lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew C. Oliver" <>
Subject Re: lucene web-app & russian language
Date Sat, 02 Mar 2002 16:28:17 GMT

Sorry, Lucene supports other languages but the webapp was written to
English.  Change out the analyzer.  If you can adapt it to make it
configurable I'd be happy to adapt both the "getting started guide" and
commit the changes.



On Fri, 2002-03-01 at 15:49, Ype Kingma wrote:
> Philipp,
> >Hi! I was trying the lucene web-app (lucene-1.2-rc5-dev.jar). I've created
> >and indexed a simple html document with both english and russian words. it
> >was ANSI encoded, if I check  _3.fdt from created index, I can see my
> >document indexed and both russian and english terms indexed (it opens in utf
> >encoding, i suppose). but the problem starts when searching. If i search
> >with russian word, it returns nothing, if I search with engglish, it returns
> >a result, but all russian words are returned as ? signs. I've changed .jsp
> >contenttypes to return in UTF-8 encoding, but the resukt is still the same.
> >
> >So, finally, does Lucene those multilingual search or not? What am I doing
> >wrong? I am trying to make it work since version 1.0 with russian docs, but
> >still no idea and no resutls :((((((
> Did you read the FAQ on the use of the StandardAnalyzer during indexing
> and query parsing? You might need to replace it with a RussianAnalyzer
> which you'll have to make yourself when no one has done this before
> you. Have a look at the GermanAnalyzer for some inspiration.
> Good luck,
> Ype
> -- 
> --
> To unsubscribe, e-mail:   <>
> For additional commands, e-mail: <>
-- - port of Excel/Word/OLE 2 Compound Document 
                            format to java 
			- fix java generics!
The avalanche has already started. It is too late for the pebbles to
-Ambassador Kosh

To unsubscribe, e-mail:   <>
For additional commands, e-mail: <>

View raw message