hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zichen Sun (JIRA)" <j...@apache.org>
Subject [jira] [Created] (MAPREDUCE-7158) Inefficient Flush Logic in JobHistory EventWriter
Date Mon, 12 Nov 2018 19:26:00 GMT
Zichen Sun created MAPREDUCE-7158:
-------------------------------------

             Summary: Inefficient Flush Logic in JobHistory EventWriter
                 Key: MAPREDUCE-7158
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7158
             Project: Hadoop Map/Reduce
          Issue Type: Bug
            Reporter: Zichen Sun


In HDFS, if the flush is implemented to send server request to actually commit the pending
writes on the storage service side, we could observe in the benchmark runs that the MR jobs
are taking much longer. From investigation we see the current implementation for writing events
doesn't look right:
EventWriter# write()
This flush is redundant and this statement should be removed. It defeats the purpose of having
a separate flush function itself.
Encoder.flush calls flush of the underlying output stream
After patching with the fix the MR jobs could complete normally, please kindly find the patch
in attached.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: mapreduce-dev-help@hadoop.apache.org


Mime
View raw message