spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hyukjin Kwon (Jira)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-28264) Revisiting Python / pandas UDF
Date Mon, 30 Dec 2019 09:42:00 GMT

    [ https://issues.apache.org/jira/browse/SPARK-28264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17005228#comment-17005228
] 

Hyukjin Kwon commented on SPARK-28264:
--------------------------------------

I came up with a new proposal. Please take a look guys if you guys fine some time.

> Revisiting Python / pandas UDF
> ------------------------------
>
>                 Key: SPARK-28264
>                 URL: https://issues.apache.org/jira/browse/SPARK-28264
>             Project: Spark
>          Issue Type: Improvement
>          Components: PySpark, SQL
>    Affects Versions: 3.0.0
>            Reporter: Reynold Xin
>            Assignee: Reynold Xin
>            Priority: Blocker
>
> In the past two years, the pandas UDFs are perhaps the most important changes to Spark
for Python data science. However, these functionalities have evolved organically, leading
to some inconsistencies and confusions among users. This document revisits UDF definition
and naming, as a result of discussions among Xiangrui, Li Jin, Hyukjin, and Reynold.
> -See document here: [https://docs.google.com/document/d/10Pkl-rqygGao2xQf6sddt0b-4FYK4g8qr_bXLKTL65A/edit#|https://docs.google.com/document/d/10Pkl-rqygGao2xQf6sddt0b-4FYK4g8qr_bXLKTL65A/edit]-
>  New proposal: https://docs.google.com/document/d/1-kV0FS_LF2zvaRh_GhkV32Uqksm_Sq8SvnBBmRyxm30/edit?usp=sharing



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message