spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Artz <michaelea...@gmail.com>
Subject Re: Dataframe vs dataset
Date Sat, 28 Apr 2018 14:25:37 GMT
Ok from the language you used, you are saying kind of that Dataset is a
subset of Dataframe.  I would disagree because to me a DataFrame is just a
Dataset of org.spache.spark.sql.Row

On Sat, Apr 28, 2018, 8:34 AM Marco Mistroni <mmistroni@gmail.com> wrote:

> Imho .neither..I see datasets as typed df and therefore ds are enhanced df
> Feel free to disagree..
> Kr
>
> On Sat, Apr 28, 2018, 2:24 PM Michael Artz <michaeleartz@gmail.com> wrote:
>
>> Hi,
>>
>> I use Spark everyday and I have a good grip on the basics of Spark, so
>> this question isnt for myself.  But this came up and I wanted to see what
>> other Spark users would say, and I dont want to influence your answer.  And
>> SO is weird about polls. The question is
>>
>>  "Which one do you feel is accurate... Dataset is a subset of DataFrame,
>> or DataFrame a subset of Dataset?"
>>
>

Mime
View raw message