hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dmitry Tolpeko (JIRA)" <>
Subject [jira] [Commented] (HIVE-17253) Adding SUMMARY statement to HPL/SQL
Date Sun, 24 Sep 2017 20:59:00 GMT


Dmitry Tolpeko commented on HIVE-17253:

Committed the patch. The SUMMARY statement can output summary based on a custom query, but
I agree later we can utilize metastore stats to speed up the statement execution.  

> Adding SUMMARY statement to HPL/SQL
> -----------------------------------
>                 Key: HIVE-17253
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>          Components: hpl/sql
>            Reporter: Dmitry Tolpeko
>            Assignee: Dmitry Tolpeko
>             Fix For: 3.0.0
>         Attachments: HIVE-17253.1.patch
> Adding SUMMARY statement to HPL/SQL to describe a data set (table, query result) similar
to Python and R.
> For each column output the data type, number of distinct values, non-NULL rows, mean,
std, percentiles, min, max. Output additional stats for categorical columns. This helps perform
quick and easy exploratory data analysis for SQL devs and business users.

This message was sent by Atlassian JIRA

View raw message