spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nicholas Chammas <>
Subject Re: Unit tests in < 5 minutes
Date Fri, 05 Dec 2014 00:05:26 GMT
fwiw, when we did this work in HBase, we categorized the tests. Then some
tests can share a single jvm, while some others need to be isolated in
their own jvm. Nevertheless surefire can still run them in parallel by
starting/stopping several jvm.

I think we need to do this as well. Perhaps the test naming hierarchy can
be used to group non-parallelizable tests in the same JVM.

For example, here are some Hive tests from our project:


If we group tests by the first 5 parts of their name (e.g.
org.apache.spark.sql.hive), then we’d have the first 2 tests run in the
same JVM, and the next 2 tests each run in their own JVM.

I’m new to this stuff so I’m not sure if I’m going about this in the right
way, but you can see my attempt with this approach on GitHub
as well as the related discussion on JIRA

If anyone has more feedback on this, I’d love to hear it (either on this
thread or in the JIRA issue).


On Sun Sep 07 2014 at 8:28:51 PM Nicholas Chammas <> wrote:

> On Fri, Aug 8, 2014 at 1:12 PM, Reynold Xin <> wrote:
>> Nick,
>> Would you like to file a ticket to track this?
> SPARK-3431 <>:
> Parallelize execution of tests
> > Sub-task: SPARK-3432 <>:
> Fix logging of unit test execution time
> Nick

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message