[
https://issues.apache.org/jira/browse/LUCENE-5699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14103644#comment-14103644
]
ASF subversion and git services commented on LUCENE-5699:
---------------------------------------------------------
Commit 1619053 from [~teofili] in branch 'dev/trunk'
[ https://svn.apache.org/r1619053 ]
LUCENE-5699 - patch from Gergő Törcsvári for normalized score and return lists
in classification
> Lucene classification score calculation normalize and return lists
> ------------------------------------------------------------------
>
> Key: LUCENE-5699
> URL: https://issues.apache.org/jira/browse/LUCENE-5699
> Project: Lucene - Core
> Issue Type: Sub-task
> Components: modules/classification
> Reporter: Gergő Törcsvári
> Assignee: Tommaso Teofili
> Labels: gsoc2014
> Fix For: 5.0
>
> Attachments: 06-06-5699.patch, 0730.patch, 0803-base.patch,
> 0810-base.patch
>
>
> Now the classifiers can return only the "best matching" classes. If somebody
> want it to use more complex tasks he need to modify these classes for get
> second and third results too. If it is possible to return a list and it is
> not a lot resource why we dont do that? (We iterate a list so also.)
> The Bayes classifier get too small return values, and there were a bug with
> the zero floats. It was fixed with logarithmic. It would be nice to scale the
> class scores sum vlue to one, and then we coud compare two documents return
> score and relevance. (If we dont do this the wordcount in the test documents
> affected the result score.)
> With bulletpoints:
> * In the Bayes classification normalized score values, and return with result
> lists.
> * In the KNN classifier possibility to return a result list.
> * Make the ClassificationResult Comparable for list sorting.
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]