lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ahmet Arslan <iori...@yahoo.com.INVALID>
Subject Re: Document Term matrix
Date Tue, 11 Nov 2014 22:16:31 GMT
Hi,

Mahout and Carrot2 can cluster the documents from lucene index.

ahmet



On Tuesday, November 11, 2014 10:37 PM, Elshaimaa Ali <elshaimaa.ali@hotmail.com> wrote:
Hi All,
I have a Lucene index built with Lucene 4.9 for 584 text documents, I need to extract a Document-term
matrix, and Document Document similarity matrix in-order to use it to cluster the documents.
My questions:1- How can I extract the matrix and compute the similarity between documents
in Lucene.2- Is there any java based code that can cluster the documents from Lucene index.
RegardsShaimaa 

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message