mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sznajder ForMailingList <bs4mailingl...@gmail.com>
Subject Meaning of seqdump output on a cluster file
Date Sun, 02 Feb 2014 12:32:02 GMT
Hi,

I am using Mahout0.5 (the version corresponding to the mahout in action
book)

I ran a K-means clustering and ran then seqdump on the clusters file.
here is an output sample

Input Path: log-kmeans-clusters-monogram-sim_0_1/clusters-9/part-r-00000
Key class: class org.apache.hadoop.io.Text Value Class: class
org.apache.mahout.clustering.kmeans.Cluster
Key: VL-513: Value: VL-513{n=26 c=[72:0.308, 404:0.354, ....


What is please the meaning of the number 72, 404 etc...

Can I map them to the initial document text?

Benjamin

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message