hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steve Loughran (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-15446) WASB: PageBlobInputStream.skip breaks HBASE replication
Date Fri, 04 May 2018 20:22:00 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-15446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16464352#comment-16464352

Steve Loughran commented on HADOOP-15446:

bq.  I don't experience the latency you do, and didn't notice that I forgot to remove the
latency check.

no, I don't expect you do :)

All the checkstyle warnings can be ignored, as they are legit changes.

Given the test is doing real IO, can you 
* stick it under org.apache.hadoop.fs.azure.integration
* make a subclass of org.apache.hadoop.fs.azure.integration.AbstractAzureScaleTest, which
ensures the test only runs with a -Dscale option. I'll enable this when I do my premerge test
* give it the prefix ITest. 

other than that, LGTM


> WASB: PageBlobInputStream.skip breaks HBASE replication
> -------------------------------------------------------
>                 Key: HADOOP-15446
>                 URL: https://issues.apache.org/jira/browse/HADOOP-15446
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: fs/azure
>    Affects Versions: 2.9.0, 3.0.2
>            Reporter: Thomas Marquardt
>            Assignee: Thomas Marquardt
>            Priority: Major
>         Attachments: HADOOP-15446-001.patch, HADOOP-15446-002.patch
> Page Blobs are primarily used by HBASE.  HBASE replication, which apparently has not
been used with WASB until recently, performs non-sequential reads on log files using PageBlobInputStream. 
There are bugs in this stream implementation which prevent skip and seek from working properly, and
eventually the stream state becomes corrupt and unusable.
> I believe this bug affects all releases of WASB/HADOOP.  It appears to be a day-0 bug
in PageBlobInputStream.  There were similar bugs opened in the past (HADOOP-15042) but the
issue was not properly fixed, and no test coverage was added.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org

View raw message