hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hive QA (JIRA)" <>
Subject [jira] [Commented] (HIVE-12309) TableScan should use column stats when available for better data size estimate
Date Sun, 01 Nov 2015 02:12:27 GMT


Hive QA commented on HIVE-12309:

Here are the results of testing the latest attachment:

{color:red}ERROR:{color} -1 due to no test(s) being added or modified.

{color:red}ERROR:{color} -1 due to 5 failed/errored test(s), 9742 tests executed
*Failed tests:*
TestMarkPartition - did not produce a TEST-*.xml file

Test results:
Console output:
Test logs:

Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 5 tests failed

This message is automatically generated.

ATTACHMENT ID: 12769963 - PreCommit-HIVE-TRUNK-Build

> TableScan should use column stats when available for better data size estimate
> ------------------------------------------------------------------------------
>                 Key: HIVE-12309
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>          Components: Statistics
>            Reporter: Ashutosh Chauhan
>            Assignee: Ashutosh Chauhan
>         Attachments: HIVE-12309.2.patch, HIVE-12309.patch
> Currently, all other operators use column stats to figure out data size, whereas TableScan
relies on rawDataSize. This inconsistency can result in an inconsistency where TS may have
lower Datasize then subsequent operators.

This message was sent by Atlassian JIRA

View raw message