spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joseph K. Bradley (JIRA)" <j...@apache.org>
Subject [jira] [Created] (SPARK-6292) Add RDD methods to DataFrame to preserve schema
Date Wed, 11 Mar 2015 23:53:38 GMT
Joseph K. Bradley created SPARK-6292:
----------------------------------------

             Summary: Add RDD methods to DataFrame to preserve schema
                 Key: SPARK-6292
                 URL: https://issues.apache.org/jira/browse/SPARK-6292
             Project: Spark
          Issue Type: Sub-task
          Components: SQL
    Affects Versions: 1.3.0
            Reporter: Joseph K. Bradley


Users can use RDD methods on DataFrames, but they lose the schema and need to reapply it.
 For RDD methods which preserve the schema (such as randomSplit), DataFrame should provide
versions of those methods which automatically preserve the schema.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message