flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Kurt Young (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (FLINK-11711) Add table and column stats
Date Fri, 01 Mar 2019 01:37:00 GMT

     [ https://issues.apache.org/jira/browse/FLINK-11711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Kurt Young updated FLINK-11711:
-------------------------------
    Component/s:     (was: API / Table SQL)
                 SQL / Planner

> Add table and column stats
> --------------------------
>
>                 Key: FLINK-11711
>                 URL: https://issues.apache.org/jira/browse/FLINK-11711
>             Project: Flink
>          Issue Type: New Feature
>          Components: SQL / Planner
>            Reporter: godfrey he
>            Assignee: godfrey he
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> We define two structure mode to hold statistics
> 1. TableStats: statistics for table level, contains 2 elements:
> rowCount: Long // the number of row count of table
> colStats: Map[String, ColumnStats] // map each column to its ColumnStats
> 2. ColumnStats: statistics for column level, contains 6 elements:
> ndv: Long // number of distinct values
> nullCount: Long // number of null values
> avgLen: Double // average length of column values
> maxLen: Integer // max length of column values
> max: Any // max value of column values
> min: Any // min value of column values



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message