spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Davies Liu <dav...@databricks.com>
Subject Re: Python:Streaming Question
Date Wed, 31 Dec 2014 01:43:07 GMT
There is a known bug with local scheduler, will be fixed by
https://github.com/apache/spark/pull/3779

On Sun, Dec 21, 2014 at 10:57 PM, Samarth Mailinglist
<mailinglistsamarth@gmail.com> wrote:
> I’m trying to run the stateful network word count at
> https://github.com/apache/spark/blob/master/examples/src/main/python/streaming/stateful_network_wordcount.py
> using the command:
>
> ./bin/spark-submit
> examples/src/main/python/streaming/stateful_network_wordcount.py localhost
> 9999
>
> I am also running netcat at the same time (prior to running the above
> command):
>
> nc -lk 9999
>
> However, no wordcount is printed (even though pprint() is being called).
>
> How do I print the results?
> How do I otherwise access the data at real time? Suppose I want to have a
> dashboard showing the data in running_counts?
>
> Note that
> https://github.com/apache/spark/blob/master/examples/src/main/python/streaming/network_wordcount.py
> works perfectly fine.
>
> Running Spark 1.2.0, hadoop 2.4.x prebuilt
>
> Thanks,
> Samarth

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
For additional commands, e-mail: user-help@spark.apache.org


Mime
View raw message