spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xiao Li (Jira)" <j...@apache.org>
Subject [jira] [Updated] (SPARK-28264) Revisiting Python / pandas UDF
Date Sat, 14 Dec 2019 01:13:00 GMT

     [ https://issues.apache.org/jira/browse/SPARK-28264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Xiao Li updated SPARK-28264:
----------------------------
    Priority: Blocker  (was: Critical)

> Revisiting Python / pandas UDF
> ------------------------------
>
>                 Key: SPARK-28264
>                 URL: https://issues.apache.org/jira/browse/SPARK-28264
>             Project: Spark
>          Issue Type: Improvement
>          Components: PySpark, SQL
>    Affects Versions: 3.0.0
>            Reporter: Reynold Xin
>            Assignee: Reynold Xin
>            Priority: Blocker
>
> In the past two years, the pandas UDFs are perhaps the most important changes to Spark
for Python data science. However, these functionalities have evolved organically, leading
to some inconsistencies and confusions among users. This document revisits UDF definition
and naming, as a result of discussions among Xiangrui, Li Jin, Hyukjin, and Reynold.
>  
> See document here: [https://docs.google.com/document/d/10Pkl-rqygGao2xQf6sddt0b-4FYK4g8qr_bXLKTL65A/edit#|https://docs.google.com/document/d/10Pkl-rqygGao2xQf6sddt0b-4FYK4g8qr_bXLKTL65A/edit]
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message