hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Adam Szita (JIRA)" <j...@apache.org>
Subject [jira] [Work started] (HIVE-21015) HCatLoader can't provide statistics for tables no in default DB
Date Thu, 06 Dec 2018 13:35:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-21015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Work on HIVE-21015 started by Adam Szita.
-----------------------------------------
> HCatLoader can't provide statistics for tables no in default DB
> ---------------------------------------------------------------
>
>                 Key: HIVE-21015
>                 URL: https://issues.apache.org/jira/browse/HIVE-21015
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Adam Szita
>            Assignee: Adam Szita
>            Priority: Major
>
> This is due to a former change (HIVE-20330) that does not take database into consideration
when retrieving the proper InputJobInfo for the loader.
> Found during testing:
> *07:52:56* 2018-12-05 07:52:16,599 [main] WARN  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.InputSizeReducerEstimator
- Couldn't get statistics from LoadFunc: org.apache.hive.hcatalog.pig.HCatLoader@492fa72a*07:52:56*
java.io.IOException: java.io.IOException: Could not calculate input size for location (table)
tpcds_3000_decimal_parquet.date_dim*07:52:56* 	at org.apache.hive.hcatalog.pig.HCatLoader.getStatistics(HCatLoader.java:281)*07:52:56*
	at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.InputSizeReducerEstimator.getInputSizeFromLoader(InputSizeReducerEstimator.java:171)*07:52:56*
	at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.InputSizeReducerEstimator.getTotalInputFileSize(InputSizeReducerEstimator.java:118)*07:52:56*
	at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.InputSizeReducerEstimator.getTotalInputFileSize(InputSizeReducerEstimator.java:97)*07:52:56*
	at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.InputSizeReducerEstimator.estimateNumberOfReducers(InputSizeReducerEstimator.java:80)*07:52:56*
	at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.estimateNumberOfReducers(JobControlCompiler.java:1148)*07:52:56*
	at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.calculateRuntimeReducers(JobControlCompiler.java:1115)*07:52:56*
	at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.adjustNumReducers(JobControlCompiler.java:1063)*07:52:56*
	at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.getJob(JobControlCompiler.java:564)*07:52:56*
	at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.compile(JobControlCompiler.java:333)*07:52:56*
	at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher.launchPig(MapReduceLauncher.java:221)*07:52:56*
	at org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.launchPig(HExecutionEngine.java:293)*07:52:56*
	at org.apache.pig.PigServer.launchPlan(PigServer.java:1475)*07:52:56* 	at org.apache.pig.PigServer.executeCompiledLogicalPlan(PigServer.java:1460)*07:52:56*
	at org.apache.pig.PigServer.storeEx(PigServer.java:1119)*07:52:56* 	at org.apache.pig.PigServer.store(PigServer.java:1082)*07:52:56*
	at org.apache.pig.PigServer.openIterator(PigServer.java:995)*07:52:56* 	at org.apache.pig.tools.grunt.GruntParser.processDump(GruntParser.java:782)*07:52:56*
	at org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:383)*07:52:56*
	at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:230)*07:52:56*
	at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:205)*07:52:56*
	at org.apache.pig.tools.grunt.Grunt.exec(Grunt.java:81)*07:52:56* 	at org.apache.pig.Main.run(Main.java:630)*07:52:56*
	at org.apache.pig.Main.main(Main.java:175)*07:52:56* 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native
Method)*07:52:56* 	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)*07:52:56*
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)*07:52:56*
	at java.lang.reflect.Method.invoke(Method.java:498)*07:52:56* 	at org.apache.hadoop.util.RunJar.run(RunJar.java:313)*07:52:56*
	at org.apache.hadoop.util.RunJar.main(RunJar.java:227)*07:52:56* Caused by: java.io.IOException:
Could not calculate input size for location (table) tpcds_3000_decimal_parquet.date_dim*07:52:56*
	at org.apache.hive.hcatalog.pig.HCatLoader.getStatistics(HCatLoader.java:276)*07:52:56* 
... 29 more



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message