lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Avlesh Singh <avl...@gmail.com>
Subject Re: Re : Questions about synonyms and highlighting
Date Wed, 07 Oct 2009 10:18:09 GMT
>
> 4 - the same question for highlighting with lemmatisation?
> Settings for "manage" (all highlighted) ==> the two words<em>manage</em>
> and
> "<em>management</em>" are highlighted
> Settings for "manage" ==> the first word "<em>manage</em>" is highlighted
> but
> not the second  : "management"
>

There is no Lemmatisation support in Solr as of now. The only support you
get is stemming.
Let me understand this correctly - you basically want the searches to happen
with stemmed base but want to selectively highlight the original and/or
stemmed words. Right? If yes, then AFAIK, this is not possible. Search
passes through your fields analyzers (tokenizers and filters). Highlighters,
typically, use the same set of analyzers and the behavior will be the same
as in search; this essentially means that the keywords "manage", "managing",
"management" and "manager" are REDUCED to "manage" for searchers and
highlighters.
If this can be done, then the only place to enable your "feature" could be
Lucene highlighter api's. Someone more knowledegable can tell you, if that
is possible.

I have no idea about your #3, though my idea of handling accentuation is to
apply a  ISOLatin1AccentFilterFactory and get rid of them altogether :)
I am curious to know the answer though.

Cheers
Avlesh

On Wed, Oct 7, 2009 at 3:17 PM, Nourredine K. <nourredine_k@yahoo.com>wrote:

> > I'm not an expert on hit highlighting but please find some answers
> inline:
>
> Thanks Shalin for your answers. It helps a lot.
>
> I post again questions #3 and #4 for the others :)
>
>
> 3 - Is it possible and if so How can I configure solR to set or not
> highlighting
> for tokens with diacritics ?
>
>
> Settings for "vélo" (all highlighted) ==> the two words "<em>vélo</em>"
and
> "<em>velo</em>" are highlighted
> Settings for "vélo" ==> the first word "<em>vélo</em>" is highlighted
but
> not
> the second  : "velo"
>
>
> 4 - the same question for highlighting with lemmatisation?
>
>
> Settings for "manage" (all highlighted) ==> the two words<em>manage</em>
> and
> "<em>management</em>" are highlighted
> Settings for "manage" ==> the first word "<em>manage</em>" is highlighted
> but
> not the second  : "management"
> Regard,
>
> Nourredine.
>
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message