lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mike Sokolov (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (LUCENE-5620) LowerCaseFilter.preserveOriginal
Date Sun, 20 Apr 2014 22:12:14 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-5620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13975300#comment-13975300
] 

Mike Sokolov edited comment on LUCENE-5620 at 4/20/14 10:11 PM:
----------------------------------------------------------------

Well, nested is probably not a real use case, but it could certainly happen.  In my test I
tried:

preserve, lower-case, preserve, ascii-fold, restore, restore

which gives "test", "tést", "Tést“  for input of "Tést"

why you would want to do this I have no idea

Possibly a use case is preserve, lower-case, preserve, kstem, restore, restore ? Since kstem
can only handle lower-case terms, although it would work fine the other way.


was (Author: sokolov):
Well, nested is probably not a real use case, but it could certainly happen.  In my test I
tried:

preserve, lower-case, preserve, ascii-fold, restore, restore

which gives "test", "tést", "Tést“  for input of "Tést"

why you would want to do this I have no idea

> LowerCaseFilter.preserveOriginal
> --------------------------------
>
>                 Key: LUCENE-5620
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5620
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Mike Sokolov
>         Attachments: LUCENE-5620.patch, LUCENE-5620.patch
>
>
> Following closely the model of LUCENE-5437 (which worked on ASCIIFoldingFilter), this
patch adds the ability to preserve the original token to LowerCaseFilter.  This is useful
if you want an all-lowercase search term to match without regard to case, while search terms
with uppercase letters match in a case-sensitive manner. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message