lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jan Høydahl (JIRA) <j...@apache.org>
Subject [jira] [Commented] (SOLR-7534) Handle internationalized quotes in queries
Date Fri, 05 Jun 2015 22:24:01 GMT

    [ https://issues.apache.org/jira/browse/SOLR-7534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14575344#comment-14575344
] 

Jan Høydahl commented on SOLR-7534:
-----------------------------------

Sure. Also have a customer facing problems with variants of '. You have ` and ´ and probably
more as well, which may cause differences in how tokens are split up etc.

> Handle internationalized quotes in queries
> ------------------------------------------
>
>                 Key: SOLR-7534
>                 URL: https://issues.apache.org/jira/browse/SOLR-7534
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Dawid Weiss
>            Priority: Minor
>
> This is real feedback from a customer:
> bq. Don't talk to me about “ and " as this is the number one problem we have with people
composing SOLR phrase queries.
> It's kind of funny at first... until you realize how many different quote characters
are out there and that many applications (for example Microsoft Word) automatically "convert"
standard ASCII quotes into locale-sensitive unicode variants (examples on blogs, documentation,
etc.).
> Perhaps there's a way to parse those various quote characters with some leniency?
> http://en.wikipedia.org/wiki/Quotation_mark#Summary_table_for_all_languages



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message