hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (Jira)" <j...@apache.org>
Subject [jira] [Work logged] (HADOOP-16202) Stabilize openFile() and adopt internally
Date Fri, 16 Oct 2020 10:55:00 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-16202?focusedWorklogId=501516&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-501516

ASF GitHub Bot logged work on HADOOP-16202:

                Author: ASF GitHub Bot
            Created on: 16/Oct/20 10:54
            Start Date: 16/Oct/20 10:54
    Worklog Time Spent: 10m 
      Work Description: steveloughran commented on pull request #2168:
URL: https://github.com/apache/hadoop/pull/2168#issuecomment-709975189

   I'm wondering if on distcp I should leave off the file length.
   why so: if someone updates a file between distcp being scheduled and it being executed,
the length will be different. I know people aren't "meant" to do that, but inevitably people
already do.

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:

Issue Time Tracking

    Worklog Id:     (was: 501516)
    Time Spent: 1h 50m  (was: 1h 40m)

> Stabilize openFile() and adopt internally
> -----------------------------------------
>                 Key: HADOOP-16202
>                 URL: https://issues.apache.org/jira/browse/HADOOP-16202
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs, fs/s3, tools/distcp
>    Affects Versions: 3.3.0
>            Reporter: Steve Loughran
>            Assignee: Steve Loughran
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 1h 50m
>  Remaining Estimate: 0h
> The {{openFile()}} builder API lets us add new options when reading a file
> Add an option {{"fs.s3a.open.option.length"}} which takes a long and allows the length
of the file to be declared. If set, *no check for the existence of the file is issued when
opening the file*
> Also: withFileStatus() to take any FileStatus implementation, rather than only S3AFileStatus
-and not check that the path matches the path being opened. Needed to support viewFS-style
wrapping and mounting.
> and Adopt where appropriate to stop clusters with S3A reads switched to random IO from
killing download/localization
> * fs shell copyToLocal
> * distcp
> * IOUtils.copy

This message was sent by Atlassian Jira

To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org

View raw message