lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shawn Heisey <apa...@elyograg.org>
Subject Re: Morphology of synonims
Date Wed, 28 Jan 2015 14:07:34 GMT
On 1/28/2015 5:11 AM, Reinforcer wrote:
> Is Solr capable of using morphology for synonims?
> 
> For example. Request: "inanely".
> Indexed text in Solr: "Searching keywords without morphology is fatuously".
> "inane" and "fatuous" are synonims.
> 
> So, "inanely" ---morphology----> "inane" -----synonims---> "fatuous"
> ---morphology----> "fatuously". Is this possible ("double morphology")?

Synonyms are handled via exact match.  The feature you are describing is
called stemming or lemmatization.

https://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#Stemming

It is possible to combine stemming and synonyms in the same analysis
chain, but you must figure out what the root word is to put into your
synonym list.  It may not be what you expect.  For example, the english
stemmer will probably change "acheive" to "acheiv" ... which sounds
wrong, until you remember that stemming must be applied both at index
and query time, and the user will never see that form of the word.

Synonyms are usually only applied at either index or query time.  Which
one to choose depends on your requirements, but I believe it is
typically on the query side.

The analysis tab in the admin UI is invaluable for seeing the results of
changes in the analysis chain.

Thanks,
Shawn


Mime
View raw message