lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Samphan Raruenrom <>
Subject ThaiAnalyzer for Lucene
Date Wed, 22 Feb 2006 03:51:33 GMT

I've wrote an alpha version of ThaiAnalyzer to enable
Thai in Lucene full text search.
Thai has no space between words (same for Lao and Khmer),
so you need a dictionary-based word breaker to break words.
I use ICU4j DictionaryBasedBreakIterator for this job.

I want to contribute the code using the Apache license,
so it'll be useful to other people.
How can I do this?
I see analyzers for various languages in the Sandbox.
How can I put the code there?


_/|\_ Samphan Raruenrom. Open Source Development Co., Ltd.
Tel: +66 38 311816, Fax: +66 38 773128,

View raw message