lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Luís Filipe Nassif <>
Subject Re: Get original DocValues from ICUCollationDocValuesField
Date Sun, 30 Apr 2017 20:29:23 GMT
A related question: is it possible to do faceting on a SortedDocValuesField
using Collation rules? Or faceting is always case sensitive?

Thanks in advance,

2017-04-30 12:35 GMT-03:00 Luís Filipe Nassif <>:

> Hi Lucene community!
> I can successful get original doc values from fields indexed with
> SortedDocValues with code like:
> BytesRef bref = atomicReader.getSortedDocValues(field).get(doc);
> String value = bref.utf8ToString();
> But as I need to use locale sorting, I use ICUCollationDocValuesField for
> indexing several fields. But for those fields, the code above does not
> work, the value returned is a lot of unreadable chars. I know it is because
> of the conversion of Strings to CollationKeys done by ICU Collator.
> Is there a way to convert the returned BytesRef to the original doc value?
> Or, in other words, how can I get the original String from an ICU
> RawCollationKey?
> Any help will be very appreciated!
> Thanks Lucene contributors for so great projet!
> Luis Nassif

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message