spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Armbrust <mich...@databricks.com>
Subject Re: SQL COUNT DISTINCT
Date Mon, 03 Nov 2014 21:41:45 GMT
On Mon, Nov 3, 2014 at 12:45 AM, Bojan Kostic <blood9raven@gmail.com> wrote:
>
> But will this improvement also affect when you want to count distinct on 2
> or more fields:
> SELECT COUNT(f1), COUNT(DISTINCT f2), COUNT(DISTINCT f3), COUNT(DISTINCT
> f4)
> FROM parquetFile
>

Unfortunately I think this case may be harder for us to optimize, though
could be possible with some work.


> Should i still create Jira issue/improvement for this?
>

Yes please.

Mime
View raw message