spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Navya Krishnappa (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-19442) Unable to add column to the dataset using Dataset.WithColumn() api
Date Tue, 14 Feb 2017 06:08:41 GMT

    [ https://issues.apache.org/jira/browse/SPARK-19442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15865135#comment-15865135
] 

Navya Krishnappa commented on SPARK-19442:
------------------------------------------

If the source file has 3 columns

Name	Age	    Address
Abc	       10	    Bangalore
Xyz	       10	   Bangalore

After adding new column say "State". Resultant dataset should be 
 
Name	Age	    Address    State
Abc	       10	    Bangalore
Xyz	       10	   Bangalore



> Unable to add column to the dataset using Dataset.WithColumn() api
> ------------------------------------------------------------------
>
>                 Key: SPARK-19442
>                 URL: https://issues.apache.org/jira/browse/SPARK-19442
>             Project: Spark
>          Issue Type: Bug
>          Components: Java API
>    Affects Versions: 2.0.2
>            Reporter: Navya Krishnappa
>
> When I'm creating a new column using Dataset.WithColumn() api, Analysis Exception is
thrown.
> Dataset.WithColumn() api: 
> Dataset.withColumn("newColumnName', new org.apache.spark.sql.Column("newColumnName").cast("int"));
> Stacktrace: 
> cannot resolve '`NewColumn`' given input columns: [abc,xyz ]



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message