lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alexandre Rafalovitch <arafa...@gmail.com>
Subject Re: chaining charFilter
Date Wed, 02 Dec 2020 16:43:02 GMT
Did you reload the core for it to notice the new schema? Or try creating a
new core from the same schema?

If it is a SolrCloud, you also have to upload the schema to the Zookeeper.

Regards,
   Alex.

On Wed, 2 Dec 2020 at 09:19, Arturas Mazeika <mazeika@gmail.com> wrote:

> Hi Solr-Team,
>
> The manual of charfilters says that one can chain them: (from
> https://lucene.apache.org/solr/guide/6_6/charfilterfactories.html#CharFilterFactories-solr.MappingCharFilterFactory
> ):
>
> CharFilters can be chained like Token Filters and placed in front of a
> Tokenizer. CharFilters can add, change, or remove characters while
> preserving the original character offsets to support features like
> highlighting.
>
> I am trying to filter out some of the chars from some fields, so I can do
> an efficient and effective faceting later. I tried to chaing charfilters
> for that purpose:
>
> <fieldType name="fcomp_type" class="solr.TextField"
> positionIncrementGap="100">
> <analyzer>
> <!-- taking the filename: from the path-->
> <charFilter class="solr.PatternReplaceCharFilterFactory"
> pattern="(.*[/\\])([^/\\]+)$"   replacement="$2"/>
> <charFilter class="solr.PatternReplaceCharFilterFactory"
> pattern="([0-9\-]+)T([0-9\-]+)" replacement="$1 $2"/>
> <charFilter class="solr.PatternReplaceCharFilterFactory"
> pattern="[^a-zA-Z]+"            replacement=" "/>
>
> <tokenizer  class="solr.StandardTokenizerFactory"/>
> <filter class="solr.LowerCaseFilterFactory"/>
> </analyzer>
> </fieldType>
> <field name="fcomp"                   type="fcomp_type"   indexed="true"
> stored="true"/>
>
> but in schema definition I see only the last charfilter
> [image: image.png]
>
> Any clues why?
>
> Cheers,
> Arturas
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message