hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Akshay Nehe (Jira)" <j...@apache.org>
Subject [jira] [Updated] (HADOOP-16899) Update HdfsDesign.md to reduce ambiguity
Date Tue, 03 Mar 2020 16:34:00 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-16899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Akshay Nehe updated HADOOP-16899:
---------------------------------
    Description: 
A proposed update to [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/] in
the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd last line.

The sentence is leading to ambiguity of reader.

Considering the statement segmented in 3 parts by the commas:
 # the first part talks about "one thirds of replicas";
 # the second part talks about "two thirds of replicas"
 # the third part talking about "the other third" is leading to ambiguity when one thirds
and two thirds have already accounted for the whole.

Proposed solution:

Getting rid of the third part or rephrasing entire sentence to capture the overall essence
of the sentence.

In other words, replacing 

_One third of replicas are on one node, two thirds of replicas are on one rack, and the other
third are evenly distributed across the remaining racks._

with

-_One third of replicas are on one node, two thirds of replicas are on one rack._-
{quote}{{Two replicas are on different nodes of one rack and the remaining replica is on a
node of one of the other racks.}}
{quote}
In addition to this, found 2 more sentences in same paragraph that will be corrected.

1.
{quote}However, it does reduce the aggregate network bandwidth used when reading data since
a block is placed in only two unique racks rather than three.
{quote}
will replace: does reduce -> does not reduce

2. 
{quote}With this policy, the replicas of a file do not evenly distribute across the racks.
{quote}
will replace: replicas of a file -> replicas of a block 

Please suggest if any additional meaning is getting lost with this replacement.

  was:
A proposed update to [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/] in
the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd last line.

The sentence is leading to ambiguity of reader.

Considering the statement segmented in 3 parts by the commas:
 # the first part talks about "one thirds of replicas";
 # the second part talks about "two thirds of replicas"
 # the third part talking about "the other third" is leading to ambiguity when one thirds
and two thirds have already accounted for the whole.

Proposed solution:

Getting rid of the third part or rephrasing entire sentence to capture the overall essence
of the sentence.

In other words, replacing 

_One third of replicas are on one node, two thirds of replicas are on one rack, and the other
third are evenly distributed across the remaining racks._

with

-_One third of replicas are on one node, two thirds of replicas are on one rack._-
{quote}{{Two replicas are on different nodes of one rack and the remaining replica is on a
node of one of the other racks.}}
{quote}
In addition to this, found 2 more sentences in same paragraph that can be corrected.

1.
{quote}However, it does reduce the aggregate network bandwidth used when reading data since
a block is placed in only two unique racks rather than three.
{quote}
will replace: does reduce -> does not reduce

2. 
{quote}With this policy, the replicas of a file do not evenly distribute across the racks.
{quote}
will replace: replicas of a file -> replicas of a block 

Please suggest if any additional meaning is getting lost with this replacement.


> Update HdfsDesign.md to reduce ambiguity
> ----------------------------------------
>
>                 Key: HADOOP-16899
>                 URL: https://issues.apache.org/jira/browse/HADOOP-16899
>             Project: Hadoop Common
>          Issue Type: Improvement
>          Components: documentation
>            Reporter: Akshay Nehe
>            Priority: Minor
>
> A proposed update to [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/] in
the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd last line.
> The sentence is leading to ambiguity of reader.
> Considering the statement segmented in 3 parts by the commas:
>  # the first part talks about "one thirds of replicas";
>  # the second part talks about "two thirds of replicas"
>  # the third part talking about "the other third" is leading to ambiguity when one thirds
and two thirds have already accounted for the whole.
> Proposed solution:
> Getting rid of the third part or rephrasing entire sentence to capture the overall essence
of the sentence.
> In other words, replacing 
> _One third of replicas are on one node, two thirds of replicas are on one rack, and the
other third are evenly distributed across the remaining racks._
> with
> -_One third of replicas are on one node, two thirds of replicas are on one rack._-
> {quote}{{Two replicas are on different nodes of one rack and the remaining replica is
on a node of one of the other racks.}}
> {quote}
> In addition to this, found 2 more sentences in same paragraph that will be corrected.
> 1.
> {quote}However, it does reduce the aggregate network bandwidth used when reading data
since a block is placed in only two unique racks rather than three.
> {quote}
> will replace: does reduce -> does not reduce
> 2. 
> {quote}With this policy, the replicas of a file do not evenly distribute across the racks.
> {quote}
> will replace: replicas of a file -> replicas of a block 
> Please suggest if any additional meaning is getting lost with this replacement.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org


Mime
View raw message