lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mike Sokolov (JIRA)" <>
Subject [jira] [Commented] (LUCENE-5620) LowerCaseFilter.preserveOriginal
Date Sat, 19 Apr 2014 18:32:14 GMT


Mike Sokolov commented on LUCENE-5620:

bq  whether you have field1:UPPER->0 and field1:upper->0, or field1:UPPER->0 and
field2:upper->0 makes no difference.

Yes, I see that.  But if you have field1:lower->0 *and* field2:lower->0 then you have
doubled the number of postings required, and most terms in English are going to be lower-case

> LowerCaseFilter.preserveOriginal
> --------------------------------
>                 Key: LUCENE-5620
>                 URL:
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Mike Sokolov
>         Attachments: LUCENE-5620.patch
> Following closely the model of LUCENE-5437 (which worked on ASCIIFoldingFilter), this
patch adds the ability to preserve the original token to LowerCaseFilter.  This is useful
if you want an all-lowercase search term to match without regard to case, while search terms
with uppercase letters match in a case-sensitive manner. 

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message