I ran it on a single job.SparkLens has an overhead on the job duration. I'm not ready to enable it by default on all our jobs.Attached is the output.Still trying to understand what exactly it means.On Sun, Mar 25, 2018 at 10:40 AM, Fawze Abujaber <firstname.lastname@example.org> wrote:Nice!Shmuel, Were you able to run on a cluster level or for a specific job?Did you configure it on the spark-default.conf?On Sun, 25 Mar 2018 at 10:34 Shmuel Blitz <email@example.com> wrote:Just to let you know, I have managed to run SparkLens on our cluster.I switched to the spark_1.6 branch, and also compiled against the specific image of Spark we are using (cdh5.7.6).Now I need to figure out what the output means... :PShmuelOn Fri, Mar 23, 2018 at 7:24 PM, Fawze Abujaber <firstname.lastname@example.org> wrote:Quick question:how to add the --jars /path/to/sparklens_2.11-0.1.0.
jar to the spark-default conf, should it be using:spark.driver.extraClassPath /path/to/sparklens_2.11-0.1.0. jaror i should use spark.jars option? anyone who could give an example how it should be, and if i the path for the jar should be an hdfs path as i'm using it in cluster mode.
On Fri, Mar 23, 2018 at 6:33 AM, Fawze Abujaber <email@example.com> wrote:Hi Shmuel,Did you compile the code against the right branch for Spark 1.6.I tested it and it looks working and now i'm testing the branch for a wide tests, Please use the branch for Spark 1.6On Fri, Mar 23, 2018 at 12:43 AM, Shmuel Blitz <firstname.lastname@example.org> wrote:Hi Rohit,Thanks for sharing this great tool.I tried running a spark job with the tool, but it failed with an IncompatibleClassChangeError Exception.I have opened an issue on Github.(https://github.com/qub
ole/sparklens/issues/1)ShmuelOn Thu, Mar 22, 2018 at 5:05 PM, Shmuel Blitz <email@example.com> wrote:Thanks.We will give this a try and report back.Shmuel--On Thu, Mar 22, 2018 at 4:22 PM, Rohit Karlupia <firstname.lastname@example.org> wrote:Thanks everyone!Please share how it works and how it doesn't. Both help.Fawaze, just made few changes to make this work with spark 1.6. Can you please try building from branch spark_1.6thanks,rohitkOn Thu, Mar 22, 2018 at 10:18 AM, Fawze Abujaber <email@example.com> wrote:It's super amazing .... i see it was tested on spark 2.0.0 and above, what about Spark 1.6 which is still part of Cloudera's main versions?We have a vast Spark applications with version 1.6.0On Thu, Mar 22, 2018 at 6:38 AM, Holden Karau <firstname.lastname@example.org> wrote:--Super exciting! I look forward to digging through it this weekend.On Wed, Mar 21, 2018 at 9:33 PM ☼ R Nair (रविशंकर नायर) <email@example.com> wrote:Excellent. You filled a missing link.Best,PassionOn Wed, Mar 21, 2018 at 11:36 PM, Rohit Karlupia <firstname.lastname@example.org> wrote:Hi,Happy to announce the availability of Sparklens as open source project. It helps in understanding the scalability limits of spark applications and can be a useful guide on the path towards tuning applications for lower runtime or cost.Please clone from here: https://github.com/qubole/spar klensthanks,rohitkPS: Thanks for the patience. It took couple of months to get back on this.------