mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rajesh Nikam <rajeshni...@gmail.com>
Subject Re: Figuring out good values for t1 and t2 for canopy
Date Fri, 01 Feb 2013 11:40:07 GMT
I feel having t1/t2 closer to 0 means very similar instances and 1 mean no
similarity.
You could try with 0.1 and going on either side until you get satisfactory
clusters.

Thanks
Rajesh


On Fri, Feb 1, 2013 at 4:59 PM, Chris Harrington <chris@heystaks.com> wrote:

> Seems my lack of any clusters what so ever was my own fault, wasn't
> pointing at the correct directory.
>
> Though I would still like to find some good material on this topic of
> figuring out t1 and t2, is it just trial and error or are there specific
> features of my data set that I can look at to infer at least marginally
> good values as a starting point?
>
>
> On 31 Jan 2013, at 22:37, Stefan Kreuzer wrote:
>
> > Hi Chris,
> >
> > I am also experimenting with CC. For me chosing CosineDistanceMeasure
> and values very close to 1 (>0.96) with T2 being only a little smaller than
> T1 led to reasonable values for k. Although this puzzles me too, I just
> asked a another question because of this.
> >
> >
> > -----Urspr√ľngliche Mitteilung-----
> > Von: Chris Harrington <chris@heystaks.com>
> > An: user <user@mahout.apache.org>
> > Verschickt: Do, 31 Jan 2013 7:22 pm
> > Betreff: Figuring out good values for t1 and t2 for canopy
> >
> >
> > Hi all,
> >
> > I'm trying to run canopy clustering before means and I can't seem to get
> a value
> > for t1 and t2 that give me any results.
> > No matter what values I use it results in no clusters.
> >
> > This is probably due to a severe lack of knowledge on the subject on my
> part so
> > can anyone point me toward some good resources to read up on the topic of
> > choosing a distance measure and a t1 and t2 for that measure?
> >
> >
> >
> >
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message