lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Upayavira (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-6711) Instead of docCount(), maxDoc() is used for numberOfDocuments in SimilarityBase
Date Sat, 01 Aug 2015 18:51:04 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-6711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14650500#comment-14650500
] 

Upayavira commented on LUCENE-6711:
-----------------------------------

I've often wondered the same sort of thing.

Now, given that this will likely change the score for every single query anyone does on any
Lucene based search, would it be possible to make this configurable, so that people can choose
which one they want? More particularly, to choose the point at which their scoring will change?

> Instead of docCount(), maxDoc() is used for numberOfDocuments in SimilarityBase
> -------------------------------------------------------------------------------
>
>                 Key: LUCENE-6711
>                 URL: https://issues.apache.org/jira/browse/LUCENE-6711
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: core/search
>    Affects Versions: 5.2.1
>            Reporter: Ahmet Arslan
>            Priority: Minor
>             Fix For: 5.3
>
>         Attachments: LUCENE-6711.patch
>
>
> {{SimilarityBase.java}} has the following line :
> {code}
>  long numberOfDocuments = collectionStats.maxDoc();
> {code}
> It seems like {{collectionStats.docCount()}}, which returns the total number of documents
that have at least one term for this field, is more appropriate statistics here. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message