lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <>
Subject [jira] [Commented] (LUCENE-3220) Implement various ranking models as Similarities
Date Mon, 20 Jun 2011 15:52:47 GMT


Robert Muir commented on LUCENE-3220:

a few comments (it generally looks close to me):
* maybe we should use 'numberOfDocuments' instead of 'docNo' and same with 'numberOfFieldTokens'?
this might make the naming more clear
* i'm worried about 'uniqueTermCount', do you know of which implementations require this?
this number is not accurate if the index has more than one segment.

> Implement various ranking models as Similarities
> ------------------------------------------------
>                 Key: LUCENE-3220
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Sub-task
>          Components: core/search
>    Affects Versions: flexscoring branch
>            Reporter: David Mark Nemeskey
>            Assignee: David Mark Nemeskey
>              Labels: gsoc
>         Attachments: LUCENE-3220.patch
>   Original Estimate: 336h
>  Remaining Estimate: 336h
> With [LUCENE-3174|] done, we can finally
work on implementing the standard ranking models. Currently DFR, BM25 and LM are on the menu.
>  * {{EasyStats}}: contains all statistics that might be relevant for a ranking algorithm
>  * {{EasySimilarity}}: the ancestor of all the other similarities. Hides the DocScorers
and as much implementation detail as possible
>  * _BM25_: the current "mock" implementation might be OK
>  * _LM_
>  * _DFR_
> Done:

This message is automatically generated by JIRA.
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message