lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tommaso Teofili (JIRA)" <>
Subject [jira] [Resolved] (LUCENE-5699) Lucene classification score calculation normalize and return lists
Date Mon, 03 Nov 2014 08:06:34 GMT


Tommaso Teofili resolved LUCENE-5699.
    Resolution: Fixed

> Lucene classification score calculation normalize and return lists
> ------------------------------------------------------------------
>                 Key: LUCENE-5699
>                 URL:
>             Project: Lucene - Core
>          Issue Type: Sub-task
>          Components: modules/classification
>            Reporter: Gergő Törcsvári
>            Assignee: Tommaso Teofili
>              Labels: gsoc2014
>             Fix For: 5.0, Trunk
>         Attachments: 06-06-5699.patch, 0730.patch, 0803-base.patch, 0810-base.patch
> Now the classifiers can return only the "best matching" classes. If somebody want it
to use more complex tasks he need to modify these classes for get second and third results
too. If it is possible to return a list and it is not a lot resource why we dont do that?
(We iterate a list so also.)
> The Bayes classifier get too small return values, and there were a bug with the zero
floats. It was fixed with logarithmic. It would be nice to scale the class scores sum vlue
to one, and then we coud compare two documents return score and relevance. (If we dont do
this the wordcount in the test documents affected the result score.)
> With bulletpoints:
> * In the Bayes classification normalized score values, and return with result lists.
> * In the KNN classifier possibility to return a result list.
> * Make the ClassificationResult Comparable for list sorting.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message