spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "hujiayin (JIRA)" <>
Subject [jira] [Issue Comment Deleted] (SPARK-5682) Add encrypted shuffle in spark
Date Thu, 26 May 2016 05:29:13 GMT


hujiayin updated SPARK-5682:
    Comment: was deleted

(was: Since the encrypted shuffle in spark is focus on the common module, it maybe not good
to use hadoop API. On the other side, the AES solution is a bit heavy to encode/decode the
live steaming data. )

> Add encrypted shuffle in spark
> ------------------------------
>                 Key: SPARK-5682
>                 URL:
>             Project: Spark
>          Issue Type: New Feature
>          Components: Shuffle
>            Reporter: liyunzhang_intel
>         Attachments: Design Document of Encrypted Spark Shuffle_20150209.docx, Design
Document of Encrypted Spark Shuffle_20150318.docx, Design Document of Encrypted Spark Shuffle_20150402.docx,
Design Document of Encrypted Spark Shuffle_20150506.docx
> Encrypted shuffle is enabled in hadoop 2.6 which make the process of shuffle data safer.
This feature is necessary in spark. AES  is a specification for the encryption of electronic
data. There are 5 common modes in AES. CTR is one of the modes. We use two codec JceAesCtrCryptoCodec
and OpensslAesCtrCryptoCodec to enable spark encrypted shuffle which is also used in hadoop
encrypted shuffle. JceAesCtrypoCodec uses encrypted algorithms  jdk provides while OpensslAesCtrCryptoCodec
uses encrypted algorithms  openssl provides. 
> Because ugi credential info is used in the process of encrypted shuffle, we first enable
encrypted shuffle on spark-on-yarn framework.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message