spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Vitaliy Pisarev <vitaliy.pisa...@biocatch.com>
Subject Do values adjacent to exploded columns get duplicated?
Date Wed, 07 Mar 2018 11:24:09 GMT
This is a fairly basic question but I did not find an answer to it anywhere
online:

Suppose I have the following data frame (a and b are column names):

a      |       b
---------------
1      |    [x1,x2,x3,x4] # this is an array


Now I explode column b and logically get:

a      |       b
---------------
1      |      x1
1      |      x2
1      |      x3
1      |      x4

Are the values in the adjacent columns *actually* duplicated?

Mime
View raw message