mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Miles Osborne" <>
Subject Re: kmeans inital cluster selection
Date Wed, 02 Jul 2008 23:34:33 GMT
why not just have N distinct keys and within the mapper, assign each item
one of these keys (chosen randomly)


2008/7/3 Mark Snow <>:

> I was looking through the kmeans code. As I recall, a good way to pick the
> inital cluster positions is to choose random data points. Is there an easy
> way to do 'randomly select N records' in map reduce?

The University of Edinburgh is a charitable body, registered in Scotland,
with registration number SC005336.

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message