spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From FireFly <zhaoming...@bankofamerica.com>
Subject Re: Issue with using Generalized Linear Regression for Logistic Regression modeling
Date Tue, 03 Apr 2018 19:56:58 GMT
It turns out that the weight was too large (with mean around 5000 and the
standard deviation around 8000) and caused overflow. After scaling down the
weight to, for example, numbers between 0 and 1, the code converged nicely.

Spark did not report the overflow issue. We actually found it out by running
the data set through R.



--
Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org


Mime
View raw message