spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Antony Mayi <>
Subject Re: storing MatrixFactorizationModel (pyspark)
Date Fri, 20 Feb 2015 12:22:22 GMT
well, I understand the math (having two vectors) but the python MatrixFactorizationModel object
seems to be just a wrapper around java class so not sure how to extract the two RDDs?thx,Antony.

     On Thursday, 19 February 2015, 16:32, Ilya Ganelin <> wrote:

 Yep. the matrix model had two RDD vectors representing the decomposed matrix. You can save
these to disk and re use them. 
On Thu, Feb 19, 2015 at 2:19 AM Antony Mayi <> wrote:

when getting the model out of ALS.train it would be beneficial to store it (to disk) so the
model can be reused later for any following predictions. I am using pyspark and I had no luck
pickling it either using standard pickle module or even dill.
does anyone have a solution for this (note it is pyspark)?
thank you,Antony.

View raw message