lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jaap de Jong (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SOLR-11859) CloneFieldUpdateProcessorFactory should not add {set=} to content when cloned to multivalued field
Date Tue, 16 Jan 2018 12:47:00 GMT

     [ https://issues.apache.org/jira/browse/SOLR-11859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jaap de Jong updated SOLR-11859:
--------------------------------
    Description: 
I'm using the CloneFieldUpdateProcessorFactory to copy content from all string fields _except
some predefined fields_ to a multivalued "text_final" field. This seems to work, however
each value value is prepended with "\{set=" and appended with "}".

Expected result
 Just clone all the original values into the multivalued field +without "\{set=}".+  

In my schema this field is defined as:

{{<field name="text_final" type="text_nl" indexed="true" stored="true" multiValued="true"/>}}

The fieldType is defined as:

{{<fieldType name="text_nl" class="solr.TextField" positionIncrementGap="100">}}
 {{<analyzer>}}
 {{<charFilter class="solr.PatternReplaceCharFilterFactory" pattern='>' replacement='
' />}}
{{<tokenizer class="solr.StandardTokenizerFactory"/>}}
 {{<filter class="solr.LowerCaseFilterFactory"/>}}
 {{<filter class="solr.StopFilterFactory" ignoreCase="true" words="lang/stopwords_nl.txt"}}
 {{format="snowball"/>}}
 {{<filter class="solr.HunspellStemFilterFactory"}}
 {{dictionary="lang/nederlands/nl_NL.dic"}}
 {{affix="lang/nederlands/nl_NL.aff"}}
 {{ignoreCase="true"/>}}
 {{</analyzer>}}
 {{</fieldType>}}

In my updateRequestProcessorChain the processor is defined as:

{{<processor class="solr.CloneFieldUpdateProcessorFactory">}}
 {{<lst name="source">}}
 {{<str name="fieldRegex">s_.*</str>}}
 {{<lst name="exclude">}}
 {{<str name="fieldName">s_description</str>}}
 {{<str name="fieldName">s_image_link</str>}}
 {{<str name="fieldName">s_link</str>}}
 {{</lst>}}
 {{</lst>}}
 {{<str name="dest">text_final</str>}}
 {{</processor>}}

  was:
I'm using the CloneFieldUpdateProcessorFactory to copy content from all string fields _except
some predefined fields_ to a multivalued "text_final" field. This seems to work, however
each value value is prepended with "\{set=" and appended with "}".

Expected result
Just clone all the original values into the multivalued field +without "\{set=}".+  

In my schema this field is defined as:

{{<field name="text_final" type="text_nl" indexed="true" stored="true" multiValued="true"/>}}

The fieldType is defined as:

{{<fieldType name="text_nl" class="solr.TextField" positionIncrementGap="100">}}
{{<analyzer>}}
{{<charFilter class="solr.PatternReplaceCharFilterFactory" pattern='>' replacement='
' />}}
{{<charFilter class="solr.PatternReplaceCharFilterFactory" pattern="\\{set=(.*)\}" replacement="$1"
/>}}

{{<tokenizer class="solr.StandardTokenizerFactory"/>}}
{{<filter class="solr.LowerCaseFilterFactory"/>}}
{{<filter class="solr.StopFilterFactory" ignoreCase="true" words="lang/stopwords_nl.txt"}}
{{format="snowball"/>}}
{{<filter class="solr.HunspellStemFilterFactory"}}
{{dictionary="lang/nederlands/nl_NL.dic"}}
{{affix="lang/nederlands/nl_NL.aff"}}
{{ignoreCase="true"/>}}
{{</analyzer>}}
{{</fieldType>}}

In my updateRequestProcessorChain the processor is defined as:

{{<processor class="solr.CloneFieldUpdateProcessorFactory">}}
{{<lst name="source">}}
{{<str name="fieldRegex">s_.*</str>}}
{{<lst name="exclude">}}
{{<str name="fieldName">s_description</str>}}
{{<str name="fieldName">s_image_link</str>}}
{{<str name="fieldName">s_link</str>}}
{{</lst>}}
{{</lst>}}
{{<str name="dest">text_final</str>}}
{{</processor>}}


> CloneFieldUpdateProcessorFactory should not add {set=} to content when cloned to multivalued
field
> --------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-11859
>                 URL: https://issues.apache.org/jira/browse/SOLR-11859
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: UpdateRequestProcessors
>    Affects Versions: 7.2
>            Reporter: Jaap de Jong
>            Priority: Minor
>
> I'm using the CloneFieldUpdateProcessorFactory to copy content from all string fields _except
some predefined fields_ to a multivalued "text_final" field. This seems to work, however
each value value is prepended with "\{set=" and appended with "}".
> Expected result
>  Just clone all the original values into the multivalued field +without "\{set=}".+  
> In my schema this field is defined as:
> {{<field name="text_final" type="text_nl" indexed="true" stored="true" multiValued="true"/>}}
> The fieldType is defined as:
> {{<fieldType name="text_nl" class="solr.TextField" positionIncrementGap="100">}}
>  {{<analyzer>}}
>  {{<charFilter class="solr.PatternReplaceCharFilterFactory" pattern='>' replacement='
' />}}
> {{<tokenizer class="solr.StandardTokenizerFactory"/>}}
>  {{<filter class="solr.LowerCaseFilterFactory"/>}}
>  {{<filter class="solr.StopFilterFactory" ignoreCase="true" words="lang/stopwords_nl.txt"}}
>  {{format="snowball"/>}}
>  {{<filter class="solr.HunspellStemFilterFactory"}}
>  {{dictionary="lang/nederlands/nl_NL.dic"}}
>  {{affix="lang/nederlands/nl_NL.aff"}}
>  {{ignoreCase="true"/>}}
>  {{</analyzer>}}
>  {{</fieldType>}}
> In my updateRequestProcessorChain the processor is defined as:
> {{<processor class="solr.CloneFieldUpdateProcessorFactory">}}
>  {{<lst name="source">}}
>  {{<str name="fieldRegex">s_.*</str>}}
>  {{<lst name="exclude">}}
>  {{<str name="fieldName">s_description</str>}}
>  {{<str name="fieldName">s_image_link</str>}}
>  {{<str name="fieldName">s_link</str>}}
>  {{</lst>}}
>  {{</lst>}}
>  {{<str name="dest">text_final</str>}}
>  {{</processor>}}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message