spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Parth Gandhi (JIRA)" <>
Subject [jira] [Commented] (SPARK-24935) Problem with Executing Hive UDF's from Spark 2.2 Onwards
Date Thu, 21 Feb 2019 15:26:00 GMT


Parth Gandhi commented on SPARK-24935:

Thank you [~gavin_hu] for reporting the issue in the first place. Have sent an email to the
Spark dev mailing list requesting them to push the fix for Spark 2.4.1. Will let you know
in case of any updates.

> Problem with Executing Hive UDF's from Spark 2.2 Onwards
> --------------------------------------------------------
>                 Key: SPARK-24935
>                 URL:
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.2.0, 2.3.1
>            Reporter: Parth Gandhi
>            Priority: Major
> A user of sketches library( reported an
issue with HLL Sketch Hive UDAF that seems to be a bug in Spark or Hive. Their code runs fine
in 2.1 but has an issue from 2.2 onwards. For more details on the issue, you can refer to
the discussion in the sketches-user list:
> [!msg/sketches-user/GmH4-OlHP9g/MW-J7Hg4BwAJ]
> On further debugging, we figured out that from 2.2 onwards, Spark hive UDAF provides
support for partial aggregation, and has removed the functionality that supported complete
mode aggregation(Refer and
Thus, instead of expecting update method to be called, merge method is called here ([] which
throws the exception as described in the forums above.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message