Hi,
I am running a simple query which performs JOIN operation between two parquet files and it
takes around 3-4 secs and I noticed that 70% of the time is used by UNORDERED_RECEIVER.
Sample query is -
select sum(sales),week from dfs.`C:\parquet-location\F8894180-AFFB-4803-B8CF-CCF883AA5AAF-Search_Snapshot_Data.parquet`
where model_component_id in(
select model_component_id from dfs.`C:\parquet-location\poc48k.parquet`) group by week
Can we somehow reduce unordered receiver time?
Please find the below screenshot of Visualized plan
[cid:image001.png@01D2DB8D.B3B1C790]
________________________________
This message is for the designated recipient only and may contain privileged, proprietary,
or otherwise confidential information. If you have received it in error, please notify the
sender immediately and delete the original. Any other use of the e-mail by you is prohibited.
Where allowed by local law, electronic communications with Accenture and its affiliates, including
e-mail and instant messaging (including content), may be scanned by our systems for the purposes
of information security and assessment of internal compliance with Accenture policy.
______________________________________________________________________________________
www.accenture.com
|
Mime |
- Unnamed multipart/related (inline, None, 0 bytes)
- Unnamed multipart/alternative (inline, None, 0 bytes)
|