spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alexander Bessonov (Jira)" <j...@apache.org>
Subject [jira] [Updated] (SPARK-29330) Allow users to chose the name of Spark Shuffle service
Date Wed, 02 Oct 2019 14:46:00 GMT

     [ https://issues.apache.org/jira/browse/SPARK-29330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Alexander Bessonov updated SPARK-29330:
---------------------------------------
    Description: 
As of now, Spark uses hardcoded value {{spark_shuffle}} as the name of the Shuffle Service.

HDP distribution of Spark, on the other hand, uses [{{spark2_shuffle}}|https://github.com/hortonworks/spark2-release/blob/HDP-3.1.0.0-78-tag/resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/ExecutorRunnable.scala#L117].
This is done to be able to run both Spark 1.6 and Spark 2.x on the same Hadoop cluster.

Running vanilla Spark on HDP cluster with only Spark 2.x shuffle service (HDP favor) running
becomes impossible due to the shuffle service name mismatch.

  was:
As of now, Spark uses hardcoded value {{spark_shuffle}} as the name of the Shuffle Service.

HDP distribution of Spark, on the other hand, uses [{{spark2_shuffle}}|#L117]]. This is done
to be able to run both Spark 1.6 and Spark 2.x on the same Hadoop cluster.

Running vanilla Spark on HDP cluster with only Spark 2.x shuffle service (HDP favor) running
becomes impossible due to the shuffle service name mismatch.


> Allow users to chose the name of Spark Shuffle service
> ------------------------------------------------------
>
>                 Key: SPARK-29330
>                 URL: https://issues.apache.org/jira/browse/SPARK-29330
>             Project: Spark
>          Issue Type: Improvement
>          Components: Spark Core, YARN
>    Affects Versions: 2.4.4
>            Reporter: Alexander Bessonov
>            Priority: Minor
>
> As of now, Spark uses hardcoded value {{spark_shuffle}} as the name of the Shuffle
Service.
> HDP distribution of Spark, on the other hand, uses [{{spark2_shuffle}}|https://github.com/hortonworks/spark2-release/blob/HDP-3.1.0.0-78-tag/resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/ExecutorRunnable.scala#L117].
This is done to be able to run both Spark 1.6 and Spark 2.x on the same Hadoop cluster.
> Running vanilla Spark on HDP cluster with only Spark 2.x shuffle service (HDP favor)
running becomes impossible due to the shuffle service name mismatch.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message