hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hive QA (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-19762) Druid Queries containing Joins gives wrong results.
Date Tue, 05 Jun 2018 01:18:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-19762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16501113#comment-16501113
] 

Hive QA commented on HIVE-19762:
--------------------------------



Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12926099/HIVE-19762.patch

{color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified.

{color:red}ERROR:{color} -1 due to 2 failed/errored test(s), 14461 tests executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[druid_basic2] (batchId=11)
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[druidmini_joins] (batchId=87)
{noformat}

Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/11511/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/11511/console
Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-11511/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.YetusPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 2 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12926099 - PreCommit-HIVE-Build

> Druid Queries containing Joins gives wrong results. 
> ----------------------------------------------------
>
>                 Key: HIVE-19762
>                 URL: https://issues.apache.org/jira/browse/HIVE-19762
>             Project: Hive
>          Issue Type: Bug
>          Components: Druid integration
>            Reporter: Nishant Bangarwa
>            Assignee: Nishant Bangarwa
>            Priority: Major
>         Attachments: HIVE-19762.patch
>
>
> Druid queries that have joins against self table gives wrong results. 
> e.g. 
> {code} 
> SELECT
> username AS `username`,
> SUM(double1) AS `sum_double1`
> FROM
> druid_table_with_nulls `tbl1`
>   JOIN (
>     SELECT
>     username AS `username`,
>     SUM(double1) AS `sum_double2`
>     FROM druid_table_with_nulls
>     GROUP BY `username`
>     ORDER BY `sum_double2`
>     DESC  LIMIT 10
>   )
>   `tbl2`
>     ON (`tbl1`.`username` = `tbl2`.`username`)
> GROUP BY `tbl1`.`username`;
> {code} 
> In this case one of the queries is a druid scan query and other is groupBy query. 
> During planning, the properties of these queries are set to the tableDesc and serdeInfo,
while setting the map work, we overwrite the properties from the properties present in serdeInfo,
this causes the scan query results to be deserialized using wrong column names and results
in Null values. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message