lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Olivier Jaquemet <>
Subject Re: Multiple Language Indexing and Searching
Date Tue, 06 Sep 2005 12:21:03 GMT
Gusenbauer Stefan wrote:

>I think nutch uses ngramj for language classification but i don't know
>what type of saving language information they use. In our application
>for example i save the language in an extra field in the document
>because lucene is supporting multiple fields with the same names we
>would be able to handle different languages. but for now we don't need it
But then, if you do so, you do not benefit from any specialized Analyzer 
you could use for each language, do you?
Then again, maybe it's not that interesting to use specialized analyzers 
for each language?.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message