lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lukas Zapletal <>
Subject Converting ISO88592 files to UTF8 and indexing`em
Date Fri, 03 Jan 2003 10:18:45 GMT

I have a problem. I need to index Czech content that is in HTML files in 
ISO-8859-2. Is there any way to convert them to UTF and index them?
What stream or reader have I use? Is it possible?

How can I construct queries after that... Some systems have ISO-8859-2 and 
some systems Win-1250.
Is there any way to convert query string from default (system) encoding to 

People programming ENGLISH systems are so happy... ;-)

Lukas Zapletal

To unsubscribe, e-mail:   <>
For additional commands, e-mail: <>

View raw message