hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Dere (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-18513) Query results caching
Date Wed, 01 Aug 2018 17:32:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-18513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16565684#comment-16565684
] 

Jason Dere commented on HIVE-18513:
-----------------------------------

Created a simple patch for this at HIVE-20250, but had some pushback from [~hagleitn] over
concern that customers would misunderstand some of the settings. Will leave more comments
at HIVE-20250.

Will try to add some docs when I get a chance. There aren't any memory management settings
(the results are kept in HDFS in the results cache directory), but are settings related to
size of the results kept by a single Hive instance (hive.query.results.cache.max.size, hive.query.results.cache.max.entry.size).

> Query results caching
> ---------------------
>
>                 Key: HIVE-18513
>                 URL: https://issues.apache.org/jira/browse/HIVE-18513
>             Project: Hive
>          Issue Type: Bug
>          Components: Query Planning
>            Reporter: Jason Dere
>            Assignee: Jason Dere
>            Priority: Major
>             Fix For: 3.0.0
>
>         Attachments: HIVE-18513.1.patch, HIVE-18513.2.patch, HIVE-18513.3.patch, HIVE-18513.4.patch,
HIVE-18513.5.patch, HIVE-18513.6.patch
>
>
> Add a query results cache that can save the results of an executed Hive query for reuse
on subsequent queries. This may be useful in cases where the same query is issued many
times, since Hive can return back the results of a cached query rather than having to execute
the full query on the cluster.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message