hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jeffrey Naisbitt (JIRA)" <j...@apache.org>
Subject [jira] [Created] (MAPREDUCE-2489) Jobsplits with random hostnames can make the queue unusable
Date Thu, 12 May 2011 19:33:47 GMT
Jobsplits with random hostnames can make the queue unusable

                 Key: MAPREDUCE-2489
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2489
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: jobtracker
            Reporter: Jeffrey Naisbitt
            Assignee: Jeffrey Naisbitt

We saw an issue where a custom InputSplit was returning invalid hostnames for the splits that
were then causing the JobTracker to attempt to excessively resolve host names.  This caused
a major slowdown for the JobTracker.  We should prevent invalid InputSplit hostnames from
affecting everyone else.

I propose we implement some verification for the hostnames to try to ensure that we only do
DNS lookups on valid hostnames (and fail otherwise).  We could also fail the job after a certain
number of failures in the resolve.

This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message