spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "utk.pat" <>
Subject Performance - Python streaming v/s Scala streaming
Date Mon, 24 Aug 2015 12:22:27 GMT
I am new to SPARK streaming. I was running the "kafka_wordcount" example with
a local KAFKA and SPARK instance. It was very easy to set this up and get
going :)I tried running both SCALA and Python versions of the word count
example. Python versions seems to be extremely slow. Sometimes it has delays
of more than couple of minutes. On the other hand SCALA versions seems to be
way better. I am running on a windows machine.I am trying to understand what
is the cause slowness in python streaming? Is there anything that I am
missing? For real time streaming analysis should I prefer SCALA?

View this message in context:
Sent from the Apache Spark User List mailing list archive at
View raw message