lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jan Høydahl <>
Subject Re: wildcards and German umlauts
Date Mon, 30 May 2011 08:27:46 GMT

Agree that this is annoying for foreign languages. I get the idea behind the original behaviour,
but there could be more elegant ways of handling it. It would make sense to always run the
CharFilters. Perhaps a mechanism where TokenFilters can be tagged for exclusion from wildcard
terms would be an idea. That way we can skip stemming, synonym and phonetic for wildcard terms,
but still do lowercasing and characterNormalization.

Jan Høydahl, search solution architect
Cominvent AS -

On 29. mai 2011, at 19.24, mdz-munich wrote:

> Ah, NOW I got it. It's not a bug, it's a feature. 
> But that would mean, that every character-manipulation (e.g.
> char-mapping/replacement, Porter-Stemmer in some cases ...) would cause a
> wildcard-query to fail. That too bad.
> But why? What's the Problem with passing the prefix through the
> analyzer/filter-chain?  
> Greetz,
> Sebastian
> --
> View this message in context:
> Sent from the Solr - User mailing list archive at

View raw message