spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ron Hu (JIRA)" <>
Subject [jira] [Commented] (SPARK-16026) Cost-based Optimizer framework
Date Thu, 01 Dec 2016 02:56:59 GMT


Ron Hu commented on SPARK-16026:

Hi Reynold, I previously worked on filter cardinality estimation using the old statistics
structure.  Now I need to refactor my code using the new basic statistics structure we agreed
on.  As I am traveling on a business trip now, I will resume my work on Monday after I return
to Bay Area.  Zhenhua is currently busy with some customer tasks this week.  He will return
to work on CBO soon.

> Cost-based Optimizer framework
> ------------------------------
>                 Key: SPARK-16026
>                 URL:
>             Project: Spark
>          Issue Type: New Feature
>          Components: SQL
>            Reporter: Reynold Xin
>              Labels: releasenotes
>         Attachments: Spark_CBO_Design_Spec.pdf
> This is an umbrella ticket to implement a cost-based optimizer framework beyond broadcast
join selection. This framework can be used to implement some useful optimizations such as
join reordering.
> The design should discuss how to break the work down into multiple, smaller logical units.
For example, changes to statistics class, system catalog, cost estimation/propagation in expressions,
cost estimation/propagation in operators can be done in decoupled pull requests.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message