Cool, learn something new every day.=C2=A0 Thanks again.
On Tue, Aug 9, 2016 at = 4:08 PM ayan guha <guha.ayan@gmai= l.com> wrote:

Thanks for reporting back. Glad it worked for you. Actually sum with parti= tioning behaviour is same in oracle too.

On 10 Aug 2016 03:01, "Jon Barksdale" = <jon.barksd= ale@gmail.com> wrote:
Hi=C2=A0Santoshakhilesh,=C2=A0

I'd seen that already, but I was trying to avoid using rdds to perform= this calculation.

@Ayan, it seems I was mistaken,= and doing a sum(b) over(order by b) totally works.=C2=A0 I guess I expecte= d the windowing with sum to work more like oracle.=C2=A0 Thanks for the sug= gestion :)

Thank you both for your help,=C2=A0

Jon

On Tue, Aug 9, 2016 at 3:01 AM Santoshakhilesh <santosh.akhilesh@huawei.com> wrote:

=C2=A0

From: Jon Bark= sdale [mailto:= jon.barksdale@gmail.com]
Sent: 09 August 2016 08:21
To: ayan guha
Cc: user
Subject: Re: Cumulative Sum function using Dataset API=

=C2=A0

I don't think that would work properly, and woul= d probably just give me the sum for each partition. I'll give it a try = when I get home just to be certain.

To maybe explain the intent better, if I have a column (pre sorted) of (1,2= ,3,4), then the cumulative sum would return (1,3,6,10).

Does that make sense? Naturally, if ordering a sum turns it into a cumulati= ve sum, I'll gladly use that :)

Jon

On Mon, Aug 8, 2016 at 4:55 PM ayan guha <guha.ayan@gmail.com&g= t; wrote:

You mean you are not able to use sum(col) over (part= ition by key order by some_col) ?

=C2=A0

--

Best Regards,
Ayan Guha

--001a1148755a2f881b0539abbb27--