spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jagadesh Kiran N (JIRA)" <>
Subject [jira] [Commented] (SPARK-26860) RangeBetween docs appear to be wrong
Date Wed, 13 Feb 2019 17:10:00 GMT


Jagadesh Kiran N commented on SPARK-26860:

I will the below statements to differentiate the same and raise the PR  

ROWS BETWEEN doesn't care about the exact values. It cares only about the order of rows, and
takes fixed number of preceding and following rows when computing frame.
RANGE BETWEEN considers values when computing frame.

> RangeBetween docs appear to be wrong 
> -------------------------------------
>                 Key: SPARK-26860
>                 URL:
>             Project: Spark
>          Issue Type: Bug
>          Components: PySpark
>    Affects Versions: 2.4.0
>            Reporter: Shelby Vanhooser
>            Priority: Major
>              Labels: docs, easyfix, python
>   Original Estimate: 1h
>  Remaining Estimate: 1h
> The docs describing [RangeBetween|]
for PySpark appear to be duplicates of [RowsBetween|]
even though these are functionally different windows.  Rows between reference proceeding
and succeeding rows, but rangeBetween is based on the values in these rows.  

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message