mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: Is any more detailed documentation aout the sgd logistic regression example.
Date Sat, 07 May 2011 07:24:53 GMT
Huh?

What program are you talking about?

On Fri, May 6, 2011 at 9:36 PM, Xiaobo Gu <guxiaobo1982@gmail.com> wrote:

> >> > 2. In production mode, don't use csv, you will find most of the time
> >> spent
> >> > are on parse the csv data and hash them to features. You might encode
> the
> >> > feature to vector and serialize them to the file system by MapReduce
> to
> >> > reduce cost on data parsing.
> >>
> >> Currentlly we are not familiar with Vectors, is there a standard way
> >> (command line )to encode csv files into Vector and serialize them into
> >> file system,
> >>
> >
> > There isn't a good command line for this, largely because it is difficult
> to
> > describe how to convert each CSV field.  There is some beginnings of
> efforts
> > on this, but the results are still limit.
> >
> >
> >> And what do you mean by "file system", local file system or HDFS,
> >> because you mentioned MapReduce
>
> How can I specify a HDFS URI for the --input option

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message