lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Che Dong" <>
Subject WebLucene 0.3 release:support CJK, use sax based indexing, docID based result sorting and xml format output with highlighting support.
Date Sun, 30 Nov 2003 13:57:18 GMT

Lucene search engine XML interface, provided sax based indexing, indexing sequence based result
sorting and xml output with highlight support.The CJKTokenizer support Chinese Japanese and
Korean with Westen language simultaneously.

The key features:
1 The bi-gram based CJK support: org/apache/lucene/analysis/cjk/CJKTokenizer

2 docID based result sorting: org/apache/lucene/search/IndexOrderSearcher

3 xml output: com/chedong/weblucene/search/DOMSearcher

4 sax based indexing: com/chedong/weblucene/index/SAXIndexer

5 token based highlighter: 
    reverse StopTokenzier:
    with abstract:

6 A simplified query parser:
    google like syntax with term limit
    modified from early version of Lucene :)


Che, Dong
View raw message