lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF subversion and git services (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-5699) Lucene classification score calculation normalize and return lists
Date Wed, 12 Nov 2014 08:38:34 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-5699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14207813#comment-14207813
] 

ASF subversion and git services commented on LUCENE-5699:
---------------------------------------------------------

Commit 1638715 from [~teofili] in branch 'dev/trunk'
[ https://svn.apache.org/r1638715 ]

LUCENE-5699 - normalized score for boolean perceptron classifier

> Lucene classification score calculation normalize and return lists
> ------------------------------------------------------------------
>
>                 Key: LUCENE-5699
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5699
>             Project: Lucene - Core
>          Issue Type: Sub-task
>          Components: modules/classification
>            Reporter: Gergő Törcsvári
>            Assignee: Tommaso Teofili
>              Labels: gsoc2014
>             Fix For: 5.0, Trunk
>
>         Attachments: 06-06-5699.patch, 0730.patch, 0803-base.patch, 0810-base.patch
>
>
> Now the classifiers can return only the "best matching" classes. If somebody want it
to use more complex tasks he need to modify these classes for get second and third results
too. If it is possible to return a list and it is not a lot resource why we dont do that?
(We iterate a list so also.)
> The Bayes classifier get too small return values, and there were a bug with the zero
floats. It was fixed with logarithmic. It would be nice to scale the class scores sum vlue
to one, and then we coud compare two documents return score and relevance. (If we dont do
this the wordcount in the test documents affected the result score.)
> With bulletpoints:
> * In the Bayes classification normalized score values, and return with result lists.
> * In the KNN classifier possibility to return a result list.
> * Make the ClassificationResult Comparable for list sorting.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message