spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Arun Ramakrishnan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-6599) Improve reliability and usability of Kinesis-based Spark Streaming
Date Sat, 01 Aug 2015 00:09:04 GMT

    [ https://issues.apache.org/jira/browse/SPARK-6599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14650054#comment-14650054
] 

Arun Ramakrishnan commented on SPARK-6599:
------------------------------------------

[~tdas] Curious about the design docs for this. 

> Improve reliability and usability of Kinesis-based Spark Streaming
> ------------------------------------------------------------------
>
>                 Key: SPARK-6599
>                 URL: https://issues.apache.org/jira/browse/SPARK-6599
>             Project: Spark
>          Issue Type: Improvement
>          Components: Streaming
>            Reporter: Tathagata Das
>            Assignee: Tathagata Das
>
> Currently, the KinesisReceiver can loose some data in the case of certain failures (receiver
and driver failures). Using the write ahead logs can mitigate some of the problem, but it
is not ideal because WALs dont work with S3 (eventually consistency, etc.) which is the most
likely file system to be used in the EC2 environment. Hence, we have to take a different approach
to improving reliability for Kinesis.
> A detailed design doc on how this can be achieved will be added later.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message