hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hive QA (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-20225) SerDe to support Teradata Binary Format
Date Mon, 30 Jul 2018 05:30:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-20225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16561443#comment-16561443
] 

Hive QA commented on HIVE-20225:
--------------------------------



Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12933537/HIVE-20225.1.patch

{color:green}SUCCESS:{color} +1 due to 5 test(s) being added or modified.

{color:red}ERROR:{color} -1 due to 4 failed/errored test(s), 14833 tests executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestMiniDruidCliDriver.testCliDriver[druid_timestamptz] (batchId=193)
org.apache.hadoop.hive.cli.TestMiniDruidCliDriver.testCliDriver[druidmini_joins] (batchId=193)
org.apache.hadoop.hive.cli.TestMiniDruidCliDriver.testCliDriver[druidmini_masking] (batchId=193)
org.apache.hadoop.hive.cli.TestMiniDruidCliDriver.testCliDriver[druidmini_test1] (batchId=193)
{noformat}

Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/12937/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/12937/console
Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-12937/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.YetusPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 4 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12933537 - PreCommit-HIVE-Build

> SerDe to support Teradata Binary Format
> ---------------------------------------
>
>                 Key: HIVE-20225
>                 URL: https://issues.apache.org/jira/browse/HIVE-20225
>             Project: Hive
>          Issue Type: New Feature
>          Components: Serializers/Deserializers
>            Reporter: Lu Li
>            Assignee: Lu Li
>            Priority: Major
>         Attachments: HIVE-20225.1.patch
>
>
> When using TPT/BTEQ to export/import Data from Teradata, Teradata will generate/require binary
files based on the schema.
> A Customized SerDe is needed in order to directly read these files from Hive or write
these files in order to load back to TD.
> {code:java}
> CREATE EXTERNAL TABLE `TABLE1`(
> ...)
> PARTITIONED BY (
> ...)
> ROW FORMAT SERDE
>   'org.apache.hadoop.hive.contrib.serde2.TeradataBinarySerde'
> STORED AS INPUTFORMAT
>  'org.apache.hadoop.hive.contrib.fileformat.teradata.TeradataBinaryFileInputFormat'
> OUTPUTFORMAT
>  'org.apache.hadoop.hive.contrib.fileformat.teradata.TeradataBinaryFileOutputFormat'
> LOCATION ...;
> SELECT * FROM `TABLE1`;{code}
> Problem Statement:
> Right now the fast way to export/import data from Teradata is using TPT. However, the
Hive could not directly utilize/generate these binary format because it doesn't have a SerDe
for these files.
> Result:
> Provided with the SerDe, Hive can operate upon/generate the exported Teradata Binary
Format file transparently



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message