lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jan Høydahl (JIRA) <>
Subject [jira] [Updated] (SOLR-13255) LanguageIdentifierUpdateProcessor broken for documents sent with SolrJ/javabin
Date Fri, 15 Feb 2019 10:08:00 GMT


Jan Høydahl updated SOLR-13255:
    Attachment: SOLR-13255.patch

> LanguageIdentifierUpdateProcessor broken for documents sent with SolrJ/javabin
> ------------------------------------------------------------------------------
>                 Key: SOLR-13255
>                 URL:
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: contrib - LangId
>    Affects Versions: 7.7
>            Reporter: Andreas Hubold
>            Priority: Major
>         Attachments: SOLR-13255.patch
> 7.7 changed the object type of string field values that are passed to UpdateRequestProcessor
implementations from java.lang.String to ByteArrayUtf8CharSequence. SOLR-12992 was mentioned
on solr-user as cause.
> The LangDetectLanguageIdentifierUpdateProcessor still expects String values, does not
work for CharSequences, and logs warnings instead. For example:
> {noformat}
> 2019-02-14 13:14:47.537 WARN  (qtp802600647-19) [   x:studio] o.a.s.u.p.LangDetectLanguageIdentifierUpdateProcessor
Field name_tokenized not a String value, not including in detection
> {noformat}
> I'm not sure, but there could be further places where the changed type for string values
needs to be handled. (Our custom UpdateRequestProcessor are broken as well since 7.7 and it
would be great to have a proper upgrade note as part of the release notes)

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message