lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ahmet Arslan <iori...@yahoo.com>
Subject Re: Stripping Punctuation in a fieldType
Date Fri, 15 Jan 2010 17:42:12 GMT
> I'm trying to find the best way to set up a fieldType that
> strips punctuation. 

Use solr.StandardTokenizerFactory that strips punctuations. 

Or if you do not care about alphanumeric or numeric queries use 
solr.LowerCaseTokenizerFactory that uses LetterTokenizer.

I think the right way to do this is using a
> CharacterFilter
> of some type, but I can't seem to find any examples of how
> to set this
> up in a schema.xml file. 

If you want to use solr.MappingCharFilterFactory you need to write all punctiation characters
to a text file manually. e.g. "," => ""


      

Mime
View raw message