lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ahmet Arslan <>
Subject Re: Document Term matrix
Date Tue, 11 Nov 2014 22:16:31 GMT

Mahout and Carrot2 can cluster the documents from lucene index.


On Tuesday, November 11, 2014 10:37 PM, Elshaimaa Ali <> wrote:
Hi All,
I have a Lucene index built with Lucene 4.9 for 584 text documents, I need to extract a Document-term
matrix, and Document Document similarity matrix in-order to use it to cluster the documents.
My questions:1- How can I extract the matrix and compute the similarity between documents
in Lucene.2- Is there any java based code that can cluster the documents from Lucene index.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message