hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gabor Bota (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-15593) UserGroupInformation TGT renewer throws NPE
Date Fri, 20 Jul 2018 13:32:00 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-15593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16550762#comment-16550762
] 

Gabor Bota commented on HADOOP-15593:
-------------------------------------

Thanks [~eyang] for the review!

There is no difference between the v1 and v2 solution about how this is handled. Please also
note the following comment in the IOException cache, where getNextTgtRenewalTime is used and
RetryPolicy is defined:
{noformat}
            // Use a dummy maxRetries to create the policy. The policy will
            // only be used to get next retry time with exponential back-off.
            // The final retry time will be later limited within the
            // tgt endTime in getNextTgtRenewalTime.
{noformat}

I think a solution for this would be to move this to the try block, instead of creating the
RetryPolicy in the catch block, so all renewal time would be based on the RetryPolicy.

As this issue is a blocker one, so really need to be finished asap (also only aims to target
the NPE), I will create another issue for changing the retry behavior.

> UserGroupInformation TGT renewer throws NPE
> -------------------------------------------
>
>                 Key: HADOOP-15593
>                 URL: https://issues.apache.org/jira/browse/HADOOP-15593
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: security
>    Affects Versions: 3.0.0
>            Reporter: Wei-Chiu Chuang
>            Assignee: Gabor Bota
>            Priority: Blocker
>         Attachments: HADOOP-15593.001.patch, HADOOP-15593.002.patch
>
>
> Found the following NPE thrown in UGI tgt renewer. The NPE was thrown within an exception
handler so the original exception was hidden, though it's likely caused by expired tgt.
> {noformat}
> 18/07/02 10:30:57 ERROR util.SparkUncaughtExceptionHandler: Uncaught exception in thread
Thread[TGT Renewer for foo@EXAMPLE.COM,5,main]
> java.lang.NullPointerException
>         at javax.security.auth.kerberos.KerberosTicket.getEndTime(KerberosTicket.java:482)
>         at org.apache.hadoop.security.UserGroupInformation$1.run(UserGroupInformation.java:894)
>         at java.lang.Thread.run(Thread.java:748){noformat}
> Suspect it's related to [https://bugs.openjdk.java.net/browse/JDK-8154889].
> The relevant code was added in HADOOP-13590. File this jira to handle the exception better.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org


Mime
View raw message