beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chamikara Jayalath (JIRA)" <j...@apache.org>
Subject [jira] [Created] (BEAM-3217) Add a performance test for HadoopInputFormatIO
Date Fri, 17 Nov 2017 09:59:00 GMT
Chamikara Jayalath created BEAM-3217:
----------------------------------------

             Summary: Add a performance test for HadoopInputFormatIO
                 Key: BEAM-3217
                 URL: https://issues.apache.org/jira/browse/BEAM-3217
             Project: Beam
          Issue Type: Test
          Components: sdk-java-extensions
            Reporter: Chamikara Jayalath
            Assignee: Chamikara Jayalath


We should add a large scale performance test for HadoopInputFormatIO. We should use PerfKitBenchmarker
based performance testing framework [1] to manage Kubernetes based muti-node data store and
to publish benchmark results.

Example input format implementation to use: DBInputFormat to connect to a Postgres instance.
https://github.com/hanborq/hadoop/blob/master/src/mapred/org/apache/hadoop/mapreduce/lib/db/DBInputFormat.java
 
Example docker image to use: https://hub.docker.com/_/postgres/

[1] https://beam.apache.org/documentation/io/testing/



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message