spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Xiangrui Meng <men...@gmail.com>
Subject Re: MLLib Decision Tress algorithm hangs, others fine
Date Tue, 11 Nov 2014 18:08:58 GMT
Could you provide more information? For example, spark version,
dataset size (number of instances/number of features), cluster size,
error messages from both the drive and the executor. -Xiangrui

On Mon, Nov 10, 2014 at 11:28 AM, tsj <tsjnsn@gmail.com> wrote:
> Hello all,
>
> I have some text data that I am running different algorithms on.
> I had no problems with LibSVM and Naive Bayes on the same data,
> but when I run Decision Tree, the execution hangs in the middle
> of DecisionTree.trainClassifier(). The only difference from the example
> given on the site is that I am using 6 categories instead of 2, and the
> input is text that is transformed to labeled points using TF-IDF. It
> halts shortly after this log output:
>
> spark.SparkContext: Job finished: collect at DecisionTree.scala:1347, took
> 1.019579676 s
>
> Any ideas as to what could be causing this?
>
>
>
> --
> View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/MLLib-Decision-Tress-algorithm-hangs-others-fine-tp18515.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
> For additional commands, e-mail: user-help@spark.apache.org
>

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
For additional commands, e-mail: user-help@spark.apache.org


Mime
View raw message