spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sea aj <>
Subject Training A ML Model on a Huge Dataframe
Date Wed, 23 Aug 2017 12:27:57 GMT

I am trying to feed a huge dataframe to a ml algorithm in Spark but it
crashes due to the shortage of memory.

Is there a way to train the model on a subset of the data in multiple steps?


<> Sent with Mailtrack

View raw message