mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dmitriy Lyubimov <dlie...@gmail.com>
Subject Re: need help on mahout
Date Fri, 09 Nov 2012 18:03:44 GMT
Hm. you just said you want to supervise it? That's what i suggested first
but you said you wanted supervised classficiation. Clustering is
unsupervised (at least in methods with Mahout... ).


On Fri, Nov 9, 2012 at 9:58 AM, qiaoresearcher <qiaoresearcher@gmail.com>wrote:

> yes, it is. Does mahout has examples or similar example to do this:  read
> the gzip file, construct page set, form vectors for each user, then run as
> rabbit
>
> On Fri, Nov 9, 2012 at 11:47 AM, Sean Owen <srowen@gmail.com> wrote:
>
> > That's a clustering problem, no?
> >
> >
> > On Fri, Nov 9, 2012 at 4:43 PM, qiaoresearcher <qiaoresearcher@gmail.com
> > >wrote:
> >
> > > It is a supervised classification problem.
> > >
> > > For example, a very simple case:
> > > say, overall we collect 4 pages from the data set:  { web_page 1
> >  web_page
> > > 2 web_page 3 web_page 4  }
> > > then users may have input vectors like:
> > > user1 [1 1  0  0]
> > > user2 [1 1  0  0]
> > > user3 [0 0  1  1]
> > > user4 [0 0  1  1]
> > > user5 [0 0  1  1]
> > >   ...       ....
> > >
> > > then whatever classification algorithm mahout has should return
> > > classification results as
> > > group 1 { user1, user2}
> > > group 2 { user3, user4, user5 }
> > >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message