beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ismaël Mejía (JIRA) <j...@apache.org>
Subject [jira] [Commented] (BEAM-160) Port 'NexMark Queries' to Beam for use as integration test
Date Sat, 25 Mar 2017 20:13:41 GMT

    [ https://issues.apache.org/jira/browse/BEAM-160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15941907#comment-15941907
] 

Ismaël Mejía commented on BEAM-160:
-----------------------------------

Query 5 issue on spark runner

> Port 'NexMark Queries' to Beam for use as integration test
> ----------------------------------------------------------
>
>                 Key: BEAM-160
>                 URL: https://issues.apache.org/jira/browse/BEAM-160
>             Project: Beam
>          Issue Type: Test
>          Components: testing
>            Reporter: Mark Shields
>            Assignee: Ismaël Mejía
>
> A while back we implemented the 'queries' from
>   http://datalab.cs.pdx.edu/niagara/NEXMark/
> as Gooogle Dataflow pipelines. We found them useful
> for uncovering performance problems with the sdk, our runners,
> and our service. Many of those problems only manifested under
> high load, multi-day runs, or with high 'backlog' on the incoming
> pub/sub subscriptions.
> We thus think they would be useful for other runners.
> Disclaimer: Though the original 'queries' were proposed as a way to
> benchmark 'continuous SQL' implementations, we have so far only
> used them for internal A/B and regression testing and have not validated
> them as representative of customer workloads. We would thus discourage their use for
competitive benchmarks without more work.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message