giraph-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Avery Ching (JIRA)" <j...@apache.org>
Subject [jira] [Created] (GIRAPH-356) Help debug ZooKeeper issues
Date Fri, 05 Oct 2012 00:43:47 GMT
Avery Ching created GIRAPH-356:
----------------------------------

             Summary: Help debug ZooKeeper issues
                 Key: GIRAPH-356
                 URL: https://issues.apache.org/jira/browse/GIRAPH-356
             Project: Giraph
          Issue Type: Improvement
            Reporter: Avery Ching


Currently, if the ZooKeeper process fails, we have little information on why and what happened.
 This patch addresses this by keeping the last 100 log lines and dumps when the map fails
under a RuntimeException.

Here is an example of a master task failure when there is an invalid JVM argument passed to
ZooKeeper.  The error is much for obvious now.

2012-10-04 15:05:28,916 WARN org.apache.giraph.zk.ZooKeeperManager: logZooKeeperOutput: Dumping
up to last 100 lines of the ZooKeeper process STDOUT and STDERR.
2012-10-04 15:05:28,916 WARN org.apache.giraph.zk.ZooKeeperManager$StreamCollector: Unrecognized
option: -BadOpt
2012-10-04 15:05:28,916 WARN org.apache.giraph.zk.ZooKeeperManager$StreamCollector: Could
not create the Java virtual machine.
2012-10-04 15:05:28,919 INFO org.apache.hadoop.mapred.TaskLogsTruncater: Initializing logs'
truncater with mapRetainSize=-1 and reduceRetainSize=-1
2012-10-04 15:05:28,959 WARN org.apache.hadoop.mapred.Child: Error running child
java.lang.IllegalStateException: run: Caught an unrecoverable exception onlineZooKeeperServers:
Failed to connect in 5 tries!
                                 at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:591)
                                 at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:763)
                                 at org.apache.hadoop.mapred.MapTask.run(MapTask.java:369)
                                 at org.apache.hadoop.mapred.Child$4.run(Child.java:259)
                                 at java.security.AccessController.doPrivileged(Native Method)
                                 at javax.security.auth.Subject.doAs(Subject.java:396)
                                 at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
                                 at org.apache.hadoop.mapred.Child.main(Child.java:253)
Caused by: java.lang.IllegalStateException: onlineZooKeeperServers: Failed to connect in 5
tries!
       at org.apache.giraph.zk.ZooKeeperManager.onlineZooKeeperServers(ZooKeeperManager.java:721)
       at org.apache.giraph.graph.GraphMapper.setup(GraphMapper.java:328)
       at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:573)
       ... 7 more
2012-10-04 15:05:28,963 INFO org.apache.hadoop.mapred.Task: Runnning cleanup for the task



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message