lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tommaso Teofili (JIRA)" <>
Subject [jira] [Created] (SOLR-3700) Create a Classification component
Date Wed, 01 Aug 2012 20:02:02 GMT
Tommaso Teofili created SOLR-3700:

             Summary: Create a Classification component
                 Key: SOLR-3700
             Project: Solr
          Issue Type: New Feature
            Reporter: Tommaso Teofili
            Priority: Minor

Lucene/Solr can host huge sets of documents containing lots of information in fields so that
these can be used as training examples (w/ features) in order to very quickly create classifiers
algorithms to use on new documents and / or to provide an additional service.
So the idea is to create a contrib module (called 'classification') to host a ClassificationComponent
that will use already seen data (the indexed documents / fields) to classify new documents
/ text fragments.
The first version will contain a (simplistic) Lucene based Naive Bayes classifier but more
implementations should be added in the future.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message