spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chen Song (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-6292) Add RDD methods to DataFrame to preserve schema
Date Tue, 17 Mar 2015 15:49:38 GMT

    [ https://issues.apache.org/jira/browse/SPARK-6292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14365370#comment-14365370
] 

Chen Song commented on SPARK-6292:
----------------------------------

Can you explain this JIRA more exactly? Or you can give an example to describe the task.

> Add RDD methods to DataFrame to preserve schema
> -----------------------------------------------
>
>                 Key: SPARK-6292
>                 URL: https://issues.apache.org/jira/browse/SPARK-6292
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>    Affects Versions: 1.3.0
>            Reporter: Joseph K. Bradley
>
> Users can use RDD methods on DataFrames, but they lose the schema and need to reapply
it.  For RDD methods which preserve the schema (such as randomSplit), DataFrame should provide
versions of those methods which automatically preserve the schema.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message