lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eli K" <system....@gmail.com>
Subject multi-language searching with Solr
Date Mon, 05 May 2008 14:27:14 GMT
Hello folks,

Let me start by saying that I am new to Lucene and Solr.

I am in the process of designing a search back-end for a system that
receives 20k documents a day and needs to keep them available for 30
days.  The documents should be searchable on a free text field and on
about 8 other fields.

One of my requirements is to index and search documents in multiple
languages.  I would like to have the ability to stem and provide the advanced
search features that are based on it.  This will only affect the free
text field because
the rest of the fields are in English.

I can find out the language of the document before indexing and I
might be able to
provide the language to search on.  I also need to have the ability to
search across all
indexed languages (there will be 20 in total).

Given these requirements do you think this is doable with Solr?  A
major limiting factor
is that I need to stick to the 1.2 GA version and I cannot utilize the
multi-core features in
the 1.3 trunk.

I considered writing my own analyzer that will call the appropriate
Lucene analyzer for the given language
but I did not see any way for it to access the field that specifies
the language of the document.

Thanks,

Eli

p.s. I am looking for an experienced Lucene/Solr consultant to help
with the design of this system.

Mime
View raw message