spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From David Hesson <>
Subject Spark event logging with s3a
Date Thu, 08 Nov 2018 21:36:12 GMT
We are trying to use spark event logging with s3a as a destination for event data.

We added these settings to the spark submits:

spark.eventLog.dir s3a://ourbucket/sparkHistoryServer/eventLogs
spark.eventLog.enabled true

Everything works fine with smaller jobs, and we can see the history data in the history server
that’s also using s3a. However, when we tried a job with a few hundred gigs of data that
goes through multiple stages, it was dying with OOM exception (same job works fine with spark.eventLog.enabled

18/10/22 23:07:22 ERROR util.Utils: uncaught error in thread SparkListenerBus, stopping SparkContext

Full stack trace:

Does anyone have any insight or experience with using spark history server with s3a? Is this
problem being caused by perhaps something else in our configs? Any help would be appreciated.
View raw message