lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Allison (JIRA)" <j...@apache.org>
Subject [jira] [Created] (LUCENE-5503) Trivial fixes to WeightedSpanTermExtractor
Date Fri, 07 Mar 2014 20:32:42 GMT
Tim Allison created LUCENE-5503:
-----------------------------------

             Summary: Trivial fixes to WeightedSpanTermExtractor
                 Key: LUCENE-5503
                 URL: https://issues.apache.org/jira/browse/LUCENE-5503
             Project: Lucene - Core
          Issue Type: Bug
          Components: modules/highlighter
    Affects Versions: 5.0
            Reporter: Tim Allison
            Priority: Minor
         Attachments: LUCENE-5503.patch

The conversion of PhraseQuery to SpanNearQuery miscalculates the slop if there are stop words
in some cases.  The issue only really appears if there is more than one intervening run of
stop words: ab the cd the the ef.

I also noticed that the inOrder determination is based on the newly calculated slop, and it
should probably be based on the original phraseQuery.getSlop()

patch and unit tests on way



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message