hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hive QA (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-19326) stats auto gather: incorrect aggregation during UNION queries (may lead to incorrect results)
Date Sat, 23 Jun 2018 04:57:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-19326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16520970#comment-16520970
] 

Hive QA commented on HIVE-19326:
--------------------------------



Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12928658/HIVE-19326.08.patch

{color:red}ERROR:{color} -1 due to build exiting with an error

Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/12008/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/12008/console
Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-12008/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Tests exited with: NonZeroExitCodeException
Command 'bash /data/hiveptest/working/scratch/source-prep.sh' failed with exit status 1 and
output '+ date '+%Y-%m-%d %T.%3N'
2018-06-23 04:54:47.691
+ [[ -n /usr/lib/jvm/java-8-openjdk-amd64 ]]
+ export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64
+ JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64
+ export PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games
+ PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games
+ export 'ANT_OPTS=-Xmx1g -XX:MaxPermSize=256m '
+ ANT_OPTS='-Xmx1g -XX:MaxPermSize=256m '
+ export 'MAVEN_OPTS=-Xmx1g '
+ MAVEN_OPTS='-Xmx1g '
+ cd /data/hiveptest/working/
+ tee /data/hiveptest/logs/PreCommit-HIVE-Build-12008/source-prep.txt
+ [[ false == \t\r\u\e ]]
+ mkdir -p maven ivy
+ [[ git = \s\v\n ]]
+ [[ git = \g\i\t ]]
+ [[ -z master ]]
+ [[ -d apache-github-source-source ]]
+ [[ ! -d apache-github-source-source/.git ]]
+ [[ ! -d apache-github-source-source ]]
+ date '+%Y-%m-%d %T.%3N'
2018-06-23 04:54:47.695
+ cd apache-github-source-source
+ git fetch origin
+ git reset --hard HEAD
HEAD is now at 23d2b80 HIVE-19890: ACID: Inherit bucket-id from original ROW_ID for delete
deltas (Gopal V, reviewed by Eugene Koifman)
+ git clean -f -d
+ git checkout master
Already on 'master'
Your branch is up-to-date with 'origin/master'.
+ git reset --hard origin/master
HEAD is now at 23d2b80 HIVE-19890: ACID: Inherit bucket-id from original ROW_ID for delete
deltas (Gopal V, reviewed by Eugene Koifman)
+ git merge --ff-only origin/master
Already up-to-date.
+ date '+%Y-%m-%d %T.%3N'
2018-06-23 04:54:49.503
+ rm -rf ../yetus_PreCommit-HIVE-Build-12008
+ mkdir ../yetus_PreCommit-HIVE-Build-12008
+ git gc
+ cp -R . ../yetus_PreCommit-HIVE-Build-12008
+ mkdir /data/hiveptest/logs/PreCommit-HIVE-Build-12008/yetus
+ patchCommandPath=/data/hiveptest/working/scratch/smart-apply-patch.sh
+ patchFilePath=/data/hiveptest/working/scratch/build.patch
+ [[ -f /data/hiveptest/working/scratch/build.patch ]]
+ chmod +x /data/hiveptest/working/scratch/smart-apply-patch.sh
+ /data/hiveptest/working/scratch/smart-apply-patch.sh /data/hiveptest/working/scratch/build.patch
Going to apply patch with: git apply -p0
/data/hiveptest/working/scratch/build.patch:360: trailing whitespace.
create table t2a  as 
/data/hiveptest/working/scratch/build.patch:495: trailing whitespace.
	numRows             	1028                
/data/hiveptest/working/scratch/build.patch:496: trailing whitespace.
	rawDataSize         	10968               
/data/hiveptest/working/scratch/build.patch:1019: trailing whitespace.
	numRows             	15                  
/data/hiveptest/working/scratch/build.patch:1020: trailing whitespace.
	rawDataSize         	3315                
warning: squelched 21 whitespace errors
warning: 26 lines add whitespace errors.
+ [[ maven == \m\a\v\e\n ]]
+ rm -rf /data/hiveptest/working/maven/org/apache/hive
+ mvn -B clean install -DskipTests -T 4 -q -Dmaven.repo.local=/data/hiveptest/working/maven
protoc-jar: executing: [/tmp/protoc7575610504107519326.exe, --version]
libprotoc 2.5.0
protoc-jar: executing: [/tmp/protoc7575610504107519326.exe, -I/data/hiveptest/working/apache-github-source-source/standalone-metastore/src/main/protobuf/org/apache/hadoop/hive/metastore,
--java_out=/data/hiveptest/working/apache-github-source-source/standalone-metastore/target/generated-sources,
/data/hiveptest/working/apache-github-source-source/standalone-metastore/src/main/protobuf/org/apache/hadoop/hive/metastore/metastore.proto]
ANTLR Parser Generator  Version 3.5.2
Output file /data/hiveptest/working/apache-github-source-source/standalone-metastore/target/generated-sources/org/apache/hadoop/hive/metastore/parser/FilterParser.java
does not exist: must build /data/hiveptest/working/apache-github-source-source/standalone-metastore/src/main/java/org/apache/hadoop/hive/metastore/parser/Filter.g
org/apache/hadoop/hive/metastore/parser/Filter.g
[ERROR] Failed to execute goal org.apache.maven.plugins:maven-remote-resources-plugin:1.5:process
(process-resource-bundles) on project hive-shims-common: Execution process-resource-bundles
of goal org.apache.maven.plugins:maven-remote-resources-plugin:1.5:process failed. ConcurrentModificationException
-> [Help 1]
[ERROR] 
[ERROR] To see the full stack trace of the errors, re-run Maven with the -e switch.
[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR] 
[ERROR] For more information about the errors and possible solutions, please read the following
articles:
[ERROR] [Help 1] http://cwiki.apache.org/confluence/display/MAVEN/PluginExecutionException
[ERROR] 
[ERROR] After correcting the problems, you can resume the build with the command
[ERROR]   mvn <goals> -rf :hive-shims-common
+ result=1
+ '[' 1 -ne 0 ']'
+ rm -rf yetus_PreCommit-HIVE-Build-12008
+ exit 1
'
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12928658 - PreCommit-HIVE-Build

> stats auto gather: incorrect aggregation during UNION queries (may lead to incorrect
results)
> ---------------------------------------------------------------------------------------------
>
>                 Key: HIVE-19326
>                 URL: https://issues.apache.org/jira/browse/HIVE-19326
>             Project: Hive
>          Issue Type: Bug
>          Components: Statistics
>            Reporter: Sergey Shelukhin
>            Assignee: Zoltan Haindrich
>            Priority: Critical
>         Attachments: HIVE-19326.01wip01.patch, HIVE-19326.02.patch, HIVE-19326.03.patch,
HIVE-19326.04.patch, HIVE-19326.05.patch, HIVE-19326.06.patch, HIVE-19326.06wip01.patch, HIVE-19326.06wip02.patch,
HIVE-19326.06wip03.patch, HIVE-19326.06wip04.patch, HIVE-19326.06wip05.patch, HIVE-19326.07.patch,
HIVE-19326.08.patch
>
>
> Found when investigating the results change after converting tables to MM, turns out
the MM result is correct but the current one is not.
> The test ends like so:
> {noformat}
> desc formatted small_alltypesorc_a;
> ANALYZE TABLE small_alltypesorc_a COMPUTE STATISTICS;
> desc formatted small_alltypesorc_a;
> insert into table small_alltypesorc_a select * from small_alltypesorc1a;
> desc formatted small_alltypesorc_a;
> {noformat}
> The results from the descs in the golden file are:
> {noformat}
> 	COLUMN_STATS_ACCURATE	{\"BASIC_STATS\":\"true\"}
> 	numFiles            	1                   
> 	numRows             	5                               
> ...
> 	COLUMN_STATS_ACCURATE	{\"BASIC_STATS\":\"true\"}
> 	numFiles            	1                   
> 	numRows             	15                                
> ...
> 	COLUMN_STATS_ACCURATE	{\"BASIC_STATS\":\"true\"}
> 	numFiles            	2                   
> 	numRows             	20                              
> {noformat}
> Note the result change after analyze - the original nomRows is inaccurate, but  BASIC_STATS
is set to true.
> I am assuming with metadata only optimization this can produce incorrect results.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message