beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mark Shields (JIRA)" <>
Subject [jira] [Commented] (BEAM-53) PubSubIO: reimplement in Java
Date Fri, 18 Mar 2016 17:36:33 GMT


Mark Shields commented on BEAM-53:

Oh, and this is just for the Read direction.
Write direction presumably needs UnboundedSink.

> PubSubIO: reimplement in Java
> -----------------------------
>                 Key: BEAM-53
>                 URL:
>             Project: Beam
>          Issue Type: New Feature
>          Components: runner-core
>            Reporter: Daniel Halperin
>            Assignee: Mark Shields
> PubSubIO is currently only partially implemented in Java: the DirectPipelineRunner uses
a non-scalable API in a single-threaded manner.
> In contrast, the DataflowPipelineRunner uses an entirely different code path implemented
in the Google Cloud Dataflow service.
> We need to reimplement PubSubIO in Java in order to support other runners in a scalable
> Additionally, we can take this opportunity to add new features:
> * getting timestamp from an arbitrary lambda in arbitrary formats rather than from a
message attribute in only 2 formats.
> * exposing metadata and attributes in the elements produced by PubSubIO.Read
> * setting metadata and attributes in the messages written by PubSubIO.Write

This message was sent by Atlassian JIRA

View raw message