hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Haibo Chen (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-15507) Add MapReduce counters about EC bytes read
Date Mon, 04 Jun 2018 17:53:00 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-15507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16500612#comment-16500612

Haibo Chen commented on HADOOP-15507:

The MapReduce part looks good to me, but I'm not very familiar with the erase code or the
file system API to comment on the rest of the patch.

> Add MapReduce counters about EC bytes read
> ------------------------------------------
>                 Key: HADOOP-15507
>                 URL: https://issues.apache.org/jira/browse/HADOOP-15507
>             Project: Hadoop Common
>          Issue Type: Improvement
>            Reporter: Xiao Chen
>            Assignee: Xiao Chen
>            Priority: Major
>         Attachments: HADOOP-15507.01.patch, image-2018-05-31-15-29-45-729.png
> HDFS has added Erasure Coding support in HDFS-7285. There are HDFS level [ReadStatistics|https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs-client/src/main/java/org/apache/hadoop/hdfs/ReadStatistics.java] so
from DFSClient we can know how much reads are EC/replication.
> In order for users to have a better view of how much of their workload is impacted by
EC, we can expose EC read bytes to File System Counters, and to MapReduce's job counters.
This way, end users can tell from MR jobs directly.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org

View raw message