spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chen Jin <>
Subject The performance of group operation on SSD
Date Wed, 15 Jan 2014 22:04:49 GMT
Dear Spark developers:

We are benchmarking spark operations such as filter, group, join on
ssd instance i2.2xlarge on EC2. Most operations are similar or
slightly better than ephemeral disks on EC2, however, the performance
of group operation on SDD  are much worse than regular disks, at least
2x to 3x worse. Could any of you shed some lights on this behavior?

Thanks a lot,


View raw message