spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Emlyn Corrin (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-8480) Add setName for Dataframe
Date Thu, 23 Feb 2017 12:30:44 GMT

    [ https://issues.apache.org/jira/browse/SPARK-8480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15880338#comment-15880338
] 

Emlyn Corrin commented on SPARK-8480:
-------------------------------------

If anyone just wants a way to identify the RDDs in the storage tab, it turns out it's already
possible to name them by creating a temporary view (see https://github.com/apache/spark/pull/16609#issuecomment-281865742).

> Add setName for Dataframe
> -------------------------
>
>                 Key: SPARK-8480
>                 URL: https://issues.apache.org/jira/browse/SPARK-8480
>             Project: Spark
>          Issue Type: Wish
>          Components: SQL
>    Affects Versions: 1.4.0
>            Reporter: Peter Rudenko
>            Priority: Minor
>
> Rdd has a method setName, so in spark UI, it's more easily to understand what's this
cache for. E.g. ("data for LogisticRegression model", etc.). Would be nice to have the same
method for Dataframe, since it displays a logical schema, in cache page, which could be quite
big.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message