mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Goldstein, Alex" <>
Subject SGD in Mahout
Date Mon, 29 Oct 2012 17:18:27 GMT

Hi, hope anyone can help me out.
In the company I work at we are running SGD algorithms using STATA and recently testing out
Mahout and R as we need to run the model on a lot of data.
STATA has been the preference from the analytics group and confortable with the results.
An initial test in R gave similar results, but processing times were really slow in comparison.
Now trying out Mahout, and using the trainlogistic with the input file, correct target and
predictive variable, the speed is great, but the results are way off of what we expected.
The coefficients of the function are nothing even close.

Can anyone point me in the right direction on how to write our own code to run sgd algorithm
in mahout.  Haven't found much documentation regarding this, even in teh book Mahout in Action
the documentation seems scarse.

In STATA the options for running are very few.  Simply run the logistic regression with target
variable and the predictive variables and thats it.

I'm sure I'll need to write my own code for this, but just wanted som pointers if anyone had
worked with the SGD algorithm extensively.



  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message