sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hari Shreedharan (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SQOOP-761) HDFSTextExportExtractor loses lines around partition boundaries
Date Sat, 15 Dec 2012 02:06:15 GMT

     [ https://issues.apache.org/jira/browse/SQOOP-761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Hari Shreedharan updated SQOOP-761:
-----------------------------------

    Attachment: SQOOP-761.patch

Uncommented unit tests. All unit tests pass now. Fixed multiple issues:
* HdfsTextExportExtractor was using FileSystem.getPos() method - which was causing missing
data in uncompressed files - so now explicitly calculate the size.

* SequenceFile was reading data multiple times, since HdfsSequenceExportExtractor was reading
a file till the end of a file, ignoring the end parameter.

* Added forking to the unit tests, and added more memory.

*Uncommented the tests in TestHdfsExtract
                
> HDFSTextExportExtractor loses lines around partition boundaries
> ---------------------------------------------------------------
>
>                 Key: SQOOP-761
>                 URL: https://issues.apache.org/jira/browse/SQOOP-761
>             Project: Sqoop
>          Issue Type: Bug
>            Reporter: Hari Shreedharan
>            Priority: Blocker
>             Fix For: 1.99.1
>
>         Attachments: SQOOP-761-missingdata.txt, SQOOP-761.patch
>
>
> Blocker for 1.99 release

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message