spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Antony Mayi <antonym...@yahoo.com.INVALID>
Subject Re: storing MatrixFactorizationModel (pyspark)
Date Fri, 20 Feb 2015 12:22:22 GMT
well, I understand the math (having two vectors) but the python MatrixFactorizationModel object
seems to be just a wrapper around java class so not sure how to extract the two RDDs?thx,Antony.


     On Thursday, 19 February 2015, 16:32, Ilya Ganelin <ilganeli@gmail.com> wrote:
   
 

 Yep. the matrix model had two RDD vectors representing the decomposed matrix. You can save
these to disk and re use them. 
On Thu, Feb 19, 2015 at 2:19 AM Antony Mayi <antonymayi@yahoo.com.invalid> wrote:

Hi,
when getting the model out of ALS.train it would be beneficial to store it (to disk) so the
model can be reused later for any following predictions. I am using pyspark and I had no luck
pickling it either using standard pickle module or even dill.
does anyone have a solution for this (note it is pyspark)?
thank you,Antony.


 
   
Mime
View raw message