Miles Crawford created SPARK-17669:
--------------------------------------
Summary: Strange UI behavior using Datasets
Key: SPARK-17669
URL: https://issues.apache.org/jira/browse/SPARK-17669
Project: Spark
Issue Type: Bug
Components: SQL, Web UI
Affects Versions: 2.0.0
Reporter: Miles Crawford
I recently migrated my application to Spark 2.0, and everything worked well, except for one
function that uses "toDS" and the ML libraries.
This stage used to complete in 15 minutes or so on 1.6.2, and now takes almost two hours.
The UI shows very strange behavior - completed stages still being worked on, concurrent work
on tons of stages, including ones from downstream jobs:
https://dl.dropboxusercontent.com/u/231152/spark.png
The only source change I made was changing "toDF" to "toDS()" before handing my RDDs to the
ML libraries.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org
|