mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ankit Goel <>
Subject Partial Solr Index Clustering
Date Tue, 21 Jul 2015 03:17:55 GMT
I was wondering if its possible to use only partial solr index for
clustering. For example, my crawler updates my solr index every hour with
new documents, and I just want to cluster those new documents, not the old
ones. If I was programming normally, I could query solr for the latest
documents with the time constraint and then pass it as vectors to my
clustering program. But since mahout accepts solr indices directly I
thought there might be a simpler way.

Ankit Goel

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message