spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From wasauce <>
Subject How to timeout a task?
Date Sat, 27 Jun 2015 15:33:33 GMT

We use pyspark to run a set of data extractors (think regex). The extractors
(regexes) generally run quite quickly and find a few matches which are
returned and stored into a database. 

My question is -- is it possible to make the function that runs the
extractors have a timeout? I.E. if for a given file the extractor runs for
more than X seconds it terminates and returns a default value?

Here is a code snippet of what we are doing with some comments as to which
function I am looking to timeout.


Thank you

- Bill

View this message in context:
Sent from the Apache Spark User List mailing list archive at

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message