nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Fredrik Andersson <>
Subject Global term vector exists?
Date Sun, 04 Sep 2005 19:02:22 GMT
Hi gang!

Is there an accessible global term vector of all encountered terms in a 
Lucene/Nutch index, or d'you have to build this yourself by enumerating all 
the documents and gather their individual terms? I'm also wondering, has 
there been any previous attempts to make Nutch use a latent semantic 
indexing approach (matching queries by vector angles rather than keywords)? 
The map-reduce framework could really come in handy in this area.


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message