mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Pat Ferrel <>
Subject Re: spark-itemsimilarity out of memory problem
Date Mon, 22 Dec 2014 19:17:21 GMT
Also Ted has an ebook you can download:

On Dec 22, 2014, at 10:52 AM, Pat Ferrel <> wrote:

Hi Hani,

I recently read about A vey promising project. 

If you are looking at the spark-itemsimilarity for ecommerce type recommendations you may
be interested in some slide decs and blog posts I’ve done on the subject.
Check out:

Also I put up a demo site that uses some of these techniques:

Good luck,

On Dec 21, 2014, at 11:44 PM, AlShater, Hani <> wrote:

Hi All,

I am trying to use spark-itemsimilarity on 160M user interactions dataset.
The job launches and running successfully for small data 1M action.
However, when trying for the larger dataset, some spark stages continuously
fail with out of memory exception.

I tried to change the from spark default
configuration, but I face the same issue again. How could I configure spark
when using spark-itemsimilarity, or how to overcome this out of memory

Can you please advice ?


Hani Al-Shater | Data Science Manager - <>
Mob: +962 790471101 | Phone: +962 65821236 | Skype: | <> |
Nouh Al Romi Street, Building number 8, Amman, Jordan


*Download free <> mobile apps for iPhone 
<>, iPad 
<>, Android 
<> or Windows 
**and never 
miss a deal! *

View raw message