hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hive QA (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-21340) CBO: Prune non-key columns feeding into a SemiJoin
Date Fri, 01 Mar 2019 09:25:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-21340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16781498#comment-16781498
] 

Hive QA commented on HIVE-21340:
--------------------------------



Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12960682/HIVE-21340.2.patch

{color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified.

{color:red}ERROR:{color} -1 due to 1 failed/errored test(s), 15824 tests executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[semijoin] (batchId=121)
{noformat}

Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/16306/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/16306/console
Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-16306/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.YetusPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 1 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12960682 - PreCommit-HIVE-Build

> CBO: Prune non-key columns feeding into a SemiJoin
> --------------------------------------------------
>
>                 Key: HIVE-21340
>                 URL: https://issues.apache.org/jira/browse/HIVE-21340
>             Project: Hive
>          Issue Type: Bug
>          Components: CBO
>    Affects Versions: 4.0.0
>            Reporter: Gopal V
>            Assignee: Vineet Garg
>            Priority: Major
>         Attachments: HIVE-21340.1.patch, HIVE-21340.2.patch
>
>
> {code}
> explain cbo 
> with ss as 
> (select count(1), ss_item_sk, ss_ticket_number from 
>             store_sales group by ss_item_sk, ss_ticket_number 
>             having count(1) > 1) 
> select count(1) from item where i_item_sk IN (select ss_item_sk from ss);
> {code}
> Notice the {{HiveProject(ss_item_sk=[$0], ss_ticket_number=[$1], $f2=[$2])}} 
> Only ss_item_sk is relevant for the HiveSemiJoin
> {code}
> CBO PLAN:
> HiveAggregate(group=[{}], agg#0=[count()])
>   HiveSemiJoin(condition=[=($0, $1)], joinType=[inner])
>     HiveProject(i_item_sk=[$0])
>       HiveFilter(condition=[IS NOT NULL($0)])
>         HiveTableScan(table=[[tpcds_copy_orc_partitioned_10000, item]], table:alias=[item])
>     HiveProject(ss_item_sk=[$0], ss_ticket_number=[$1], $f2=[$2])
>       HiveFilter(condition=[>($2, 1)])
>         HiveAggregate(group=[{1, 8}], agg#0=[count()])
>           HiveFilter(condition=[IS NOT NULL($1)])
>             HiveTableScan(table=[[tpcds_copy_orc_partitioned_10000, store_sales]], table:alias=[store_sales])
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message