Hi Chris,
Yes, people have done classification with Lucene before. Have a look at
http://search-lucene.com/?q=classifier&fc_project=Lucene for some discussions
and actual code (in old JIRA issues)
Otis
----
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
----- Original Message ----
> From: Chris Spencer <chrisspen@gmail.com>
> To: java-user@lucene.apache.org
> Sent: Wed, April 6, 2011 7:46:45 PM
> Subject: Indexing Non-Textual Data
>
> Hi,
>
> I'm new to Lucene, so forgive me if this is a newbie question. I have a
> dataset composed of several thousand lists of 128 integer features, each
> list associated with a class label. Would it be possible to use Lucene as a
> classifier, by indexing the label with respect to these integer features,
> and then classify a new list by finding the most similar labels with Lucene?
>
> I'm specifically interested in doing so through the PyLucene API, so I've
> been going through the PyLucene samples, but they only seem to involve
> indexing text, not continuous features (understandably). Could anyone point
> me to an example that indexes non-textual data?
>
> I think the project Lire (http://www.semanticmetadata.net/lire/) is using
> Lucene to do something similar to this, although with an emphasis on image
> features. I've dug into their code a little, but I'm not a strong Java
> programmer, so I'm not sure how they're pulling it off, nor how I might
> translate this into the PyLucene API. In your opinion, is this a practical
> use of Lucene?
>
> Regards,
> Chris
>
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
|