lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vinay B," <vybe3...@gmail.com>
Subject Language Identification and Stemming
Date Fri, 01 Mar 2013 20:49:03 GMT
As I understand, SOLR allows us to plug in language detection
processors: http://wiki.apache.org/solr/LanguageDetection

GIven that our use case involves a collection of mixed language documents,
Q1: Assume that we plug in language detection, will this affect the
stemming and other language specific operations eg. will the stemmers
use the correct language identified by the language detection code:
http://www.early-dance.de/news/9188-optimizing-apachesolr-non-english-languages
Q2. Currently, we don't explicitly use a processor chain  for our
updates, .. just a custom update handler that also returns custom
opcodes etc in the response. If we plug  language detection via an
update chain connected to this request handler, (how) can we pass the
chosen language back via the response?

    <requestHandler name="/update/myupdatet"
                  class="com.xyz.MyDocUpdateHandler" />

Thanks

Mime
View raw message