Hi,
 Recently I gave a talk on RDD data structure which gives in depth understanding of spark internals. You can watch it on youtube. Also slides are on slideshare and code is on github



Regards,
Madhukara Phatak
http://datamantra.io/