spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "nicu marasoiu (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-7398) Add back-pressure to Spark Streaming (umbrella JIRA)
Date Thu, 01 Oct 2015 08:34:26 GMT

    [ https://issues.apache.org/jira/browse/SPARK-7398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14939501#comment-14939501
] 

nicu marasoiu commented on SPARK-7398:
--------------------------------------

Hi!

Can I help in any way? Back pressure is fundamental to implementing a reactive pipeline with
spark streaming.
I see that there is a single task which is not resolved, is it a clear thing that I can take
maybe?

Thank you,
Nicu

> Add back-pressure to Spark Streaming (umbrella JIRA)
> ----------------------------------------------------
>
>                 Key: SPARK-7398
>                 URL: https://issues.apache.org/jira/browse/SPARK-7398
>             Project: Spark
>          Issue Type: Improvement
>          Components: Streaming
>    Affects Versions: 1.3.1
>            Reporter: Fran├žois Garillot
>            Assignee: Tathagata Das
>            Priority: Critical
>              Labels: streams
>
> Spark Streaming has trouble dealing with situations where 
>  batch processing time > batch interval
> Meaning a high throughput of input data w.r.t. Spark's ability to remove data from the
queue.
> If this throughput is sustained for long enough, it leads to an unstable situation where
the memory of the Receiver's Executor is overflowed.
> This aims at transmitting a back-pressure signal back to data ingestion to help with
dealing with that high throughput, in a backwards-compatible way.
> The original design doc can be found here:
> https://docs.google.com/document/d/1ZhiP_yBHcbjifz8nJEyPJpHqxB1FT6s8-Zk7sAfayQw/edit?usp=sharing
> The second design doc, focusing [on the first sub-task|https://issues.apache.org/jira/browse/SPARK-8834]
(without all the background info, and more centered on the implementation) can be found here:
> https://docs.google.com/document/d/1ls_g5fFmfbbSTIfQQpUxH56d0f3OksF567zwA00zK9E/edit?usp=sharing



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message