spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Marcelo Vanzin (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (SPARK-5098) Number of running tasks become negative after tasks lost
Date Wed, 09 May 2018 20:45:00 GMT

     [ https://issues.apache.org/jira/browse/SPARK-5098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Marcelo Vanzin resolved SPARK-5098.
-----------------------------------
    Resolution: Cannot Reproduce

Pretty sure this has been fixed in one way or another since 1.2.

> Number of running tasks become negative after tasks lost
> --------------------------------------------------------
>
>                 Key: SPARK-5098
>                 URL: https://issues.apache.org/jira/browse/SPARK-5098
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Core
>    Affects Versions: 1.2.0
>            Reporter: Davies Liu
>            Priority: Critical
>
> 15/01/06 07:26:58 ERROR TaskSchedulerImpl: Lost executor 6 on spark-worker-002.c.lofty-inn-754.internal:
remote Akka client disassociated
> 15/01/06 07:26:58 WARN ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkExecutor@spark-worker-002.c.lofty-inn-754.internal:32852]
has failed, address is now gated for [5000] ms. Reason is: [Disassociated].
> 15/01/06 07:26:58 WARN TaskSetManager: Lost task 10.2 in stage 0.0 (TID 55, spark-worker-002.c.lofty-inn-754.internal):
ExecutorLostFailure (executor 6 lost)
> 15/01/06 07:26:58 WARN TaskSetManager: Lost task 7.2 in stage 0.0 (TID 52, spark-worker-002.c.lofty-inn-754.internal):
ExecutorLostFailure (executor 6 lost)
> 15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor
6
> 15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor
6
> [Stage 0:===========================================================(44 + -14) / 40]
> 15/01/06 07:27:10 ERROR TaskSchedulerImpl: Lost executor 2 on spark-worker-003.c.lofty-inn-754.internal:
remote Akka client disassociated
> 15/01/06 07:27:10 WARN ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkExecutor@spark-worker-003.c.lofty-inn-754.internal:39188]
has failed, address is now gated for [5000] ms. Reason is: [Disassociated].
> 15/01/06 07:27:10 WARN TaskSetManager: Lost task 16.1 in stage 0.0 (TID 60, spark-worker-003.c.lofty-inn-754.internal):
ExecutorLostFailure (executor 2 lost)
> 15/01/06 07:27:10 WARN TaskSetManager: Lost task 12.0 in stage 0.0 (TID 12, spark-worker-003.c.lofty-inn-754.internal):
ExecutorLostFailure (executor 2 lost)
> 15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor
2
> 15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor
2
> [Stage 0:==========================================================(45 + -29) / 40]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message