hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hazem Mahmoud (JIRA)" <j...@apache.org>
Subject [jira] [Created] (MAPREDUCE-6900) Terasort replication factor hard-coded for partition file (partFile)
Date Thu, 15 Jun 2017 20:39:00 GMT
Hazem Mahmoud created MAPREDUCE-6900:

             Summary: Terasort replication factor hard-coded for partition file (partFile)
                 Key: MAPREDUCE-6900
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6900
             Project: Hadoop Map/Reduce
          Issue Type: Bug
            Reporter: Hazem Mahmoud
            Priority: Minor

When running terasort on a cluster with less than 10 nodes, I get the following:

17/06/12 11:18:21 ERROR terasort.TeraSort: Requested replication factor of 10 exceeds maximum
of 4 for /tmp/hive/tera-out/_partition.lst from

There is no way to set this, as it is hard-coded here:

    DataOutputStream writer = outFs.create(partFile, true, 64*1024, (short) 10,


Had to modify TeraInputFormat.java and rebuild to get it to work. This should be configurable.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: mapreduce-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: mapreduce-dev-help@hadoop.apache.org

View raw message