spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Davies Liu <dav...@databricks.com>
Subject Re: [SparkR] is toDF() necessary
Date Sun, 17 May 2015 08:26:32 GMT
toDF() is first introduced in Scala and Python (because
createDataFrame is too long), is used in lots places, I think it's
useful.

On Fri, May 8, 2015 at 11:03 AM, Shivaram Venkataraman
<shivaram@eecs.berkeley.edu> wrote:
> Agree that toDF is not very useful. In fact it was removed from the
> namespace in a recent change
> https://github.com/apache/spark/commit/4e930420c19ae7773b138dfc7db8fc03b4660251
>
> Thanks
> Shivaram
>
> On Fri, May 8, 2015 at 1:10 AM, Sun, Rui <rui.sun@intel.com> wrote:
>
>> toDF() is defined to convert an RDD to a DataFrame. But it is just a very
>> thin wrapper of createDataFrame() by help the caller avoid input of
>> SQLContext.
>>
>> Since Scala/pySpark does not have toDF(), and we'd better keep API as
>> narrow and simple as possible. Is toDF() really necessary? Could we
>> eliminate it?
>>
>>
>>

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@spark.apache.org
For additional commands, e-mail: dev-help@spark.apache.org


Mime
View raw message