lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From François Schiettecatte <fschietteca...@gmail.com>
Subject Re: Does the Solr enable Lemmatization [not the Stemming]
Date Thu, 05 May 2011 10:40:27 GMT
Rajani

You might also want to look at Balie ( http://balie.sourceforge.net/ ), from the web site:

Features:

	• language identification
	• tokenization
	• sentence boundary detection
	• named-entity recognition


Can't vouch for it though.




On May 5, 2011, at 4:58 AM, Jan Høydahl wrote:

> Hi,
> 
> Solr does not have lemmatization out of the box.
> 
> You'll have to find 3rd party analyzers, and the most known such is from BasisTech. Please
contact them to learn more.
> 
> I'm not aware of any open source lemmatizers for Solr.
> 
> --
> Jan Høydahl, search solution architect
> Cominvent AS - www.cominvent.com
> 
> On 5. mai 2011, at 10.34, rajini maski wrote:
> 
>> Does the solr enable lemmatization concept?
>> 
>> 
>> 
>>  I found a documentation that gives an information as solr enables
>> lemmatization concept. Here is the link :
>> http://www.basistech.com/knowledge-center/search/2010-09-language-identification-language-support-and-entity-extraction.pdf
>> 
>> Can anyone help me finding the jar specified in that document so that i can
>> add it as plugin.
>> jar :rlp.solr.RLPTokenizerFactory
>> 
>> 
>> Thanks and Regards,
>> Rajani Maski
> 


Mime
View raw message