mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Xiaobo Gu <guxiaobo1...@gmail.com>
Subject Re: Is any more detailed documentation aout the sgd logistic regression example.
Date Sat, 07 May 2011 10:16:15 GMT
trainlogistic and runlogistic

2011/5/7, Ted Dunning <ted.dunning@gmail.com>:
> Huh?
>
> What program are you talking about?
>
> On Fri, May 6, 2011 at 9:36 PM, Xiaobo Gu <guxiaobo1982@gmail.com> wrote:
>
>> >> > 2. In production mode, don't use csv, you will find most of the time
>> >> spent
>> >> > are on parse the csv data and hash them to features. You might encode
>> the
>> >> > feature to vector and serialize them to the file system by MapReduce
>> to
>> >> > reduce cost on data parsing.
>> >>
>> >> Currentlly we are not familiar with Vectors, is there a standard way
>> >> (command line )to encode csv files into Vector and serialize them into
>> >> file system,
>> >>
>> >
>> > There isn't a good command line for this, largely because it is
>> > difficult
>> to
>> > describe how to convert each CSV field.  There is some beginnings of
>> efforts
>> > on this, but the results are still limit.
>> >
>> >
>> >> And what do you mean by "file system", local file system or HDFS,
>> >> because you mentioned MapReduce
>>
>> How can I specify a HDFS URI for the --input option
>

-- 
从我的移动设备发送

Mime
View raw message