spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aaron Davidson (JIRA)" <j...@apache.org>
Subject [jira] [Created] (SPARK-1771) CoarseGrainedSchedulerBackend is not resilient to Akka restarts
Date Sat, 10 May 2014 22:14:36 GMT
Aaron Davidson created SPARK-1771:
-------------------------------------

             Summary: CoarseGrainedSchedulerBackend is not resilient to Akka restarts
                 Key: SPARK-1771
                 URL: https://issues.apache.org/jira/browse/SPARK-1771
             Project: Spark
          Issue Type: Bug
          Components: Spark Core
            Reporter: Aaron Davidson


The exception reported in SPARK-1769 was propagated through the CoarseGrainedSchedulerBackend,
and caused an Actor restart of the DriverActor. Unfortunately, this actor does not seem to
have been written with Akka restartability in mind. For instance, the new DriverActor has
lost all state about the prior Executors without cleanly disconnecting them. This means that
the driver actually has executors attached to it, but doesn't think it does, which leads to
mayhem of various sorts.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message