hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "wujinhu (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (HADOOP-15027) AliyunOSS: Support multi-thread pre-read to improve read from Hadoop to Aliyun OSS performance
Date Thu, 11 Jan 2018 08:44:00 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16321869#comment-16321869
] 

wujinhu edited comment on HADOOP-15027 at 1/11/18 8:43 AM:
-----------------------------------------------------------

Hi [~Sammi], here are some performance data. I use this tool(https://github.com/hortonworks/hive-testbench)
to compare run time between this patch and current version.
{code:java}
query       after       before
query13.sql 241.591   440.524
query28.sql 1259.307 1943.949
query51.sql 469.618    722.904
query73.sql 216.596    414.75
query96.sql 268.869    476.473
{code}



was (Author: wujinhu):
Hi [~Sammi], here are some performance data. I use this tool(https://github.com/hortonworks/hive-testbench)
to compare run time between this patch and current version.
{code:java}
query           after       before
query13.sql 241.591   440.524
query28.sql 1259.307 1943.949
query51.sql 469.618 722.904
query73.sql 216.596 414.75
query96.sql 268.869 476.473
{code}


> AliyunOSS: Support multi-thread pre-read to improve read from Hadoop to Aliyun OSS performance
> ----------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-15027
>                 URL: https://issues.apache.org/jira/browse/HADOOP-15027
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs/oss
>    Affects Versions: 3.0.0
>            Reporter: wujinhu
>            Assignee: wujinhu
>         Attachments: HADOOP-15027.001.patch, HADOOP-15027.002.patch, HADOOP-15027.003.patch,
HADOOP-15027.004.patch, HADOOP-15027.005.patch, HADOOP-15027.006.patch, HADOOP-15027.007.patch,
HADOOP-15027.008.patch, HADOOP-15027.009.patch, HADOOP-15027.010.patch, HADOOP-15027.011.patch,
HADOOP-15027.012.patch
>
>
> Currently, AliyunOSSInputStream uses single thread to read data from AliyunOSS,  so we
can do some refactoring by using multi-thread pre-read to improve read performance.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org


Mime
View raw message