spark-user mailing list archives: August 2015

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · Next »Thread · Author · Date
Francis Lau Sporadic "Input validation failed" error when executing LogisticRegressionWithLBFGS.train Tue, 11 Aug, 21:56
Francis Lau Re: How to specify column type when saving DataFrame as parquet file? Fri, 14 Aug, 16:03
Ganelin, Ilya RE: How to read gzip data in Spark - Simple question Thu, 06 Aug, 05:27
Ganelin, Ilya RE: Issue when rebroadcasting a variable outside of the definition scope Fri, 07 Aug, 15:56
Garry Chen Spark ec2 lunch problem Fri, 21 Aug, 14:55
Garry Chen RE: Spark ec2 lunch problem Fri, 21 Aug, 15:15
Garry Chen RE: Spark ec2 lunch problem Mon, 24 Aug, 12:45
Garry Chen Spark-Ec2 lunch failed on starting httpd spark 141 Tue, 25 Aug, 14:39
Garry Chen start master failed with error Mon, 31 Aug, 16:02
Gaurav Agarwal spark kafka partitioning Fri, 21 Aug, 02:48
Gaurav Agarwal Re: spark kafka partitioning Fri, 21 Aug, 06:41
Gaurav Agarwal sparkStreaming how to work with partitions,how tp create partition Sat, 22 Aug, 10:39
Gavin Yue Any quick method to sample rdd based on one filed? Thu, 27 Aug, 21:27
Gavin Yue How to increase the Json parsing speed Fri, 28 Aug, 01:58
Gavin Yue Re: How to increase the Json parsing speed Fri, 28 Aug, 04:08
Gavin Yue Re: How to increase the Json parsing speed Fri, 28 Aug, 07:05
Gerald Loeffler miniBatchFraction for LinearRegressionWithSGD Fri, 07 Aug, 08:45
Gerard Maas Re: Writing streaming data to cassandra creates duplicates Tue, 04 Aug, 11:24
Giri P Re: query avro hive table in spark sql Thu, 27 Aug, 16:41
Giri P Re: query avro hive table in spark sql Thu, 27 Aug, 16:45
Giri P Re: query avro hive table in spark sql Thu, 27 Aug, 18:15
Giri P Re: query avro hive table in spark sql Fri, 28 Aug, 21:06
Gourav Sengupta Re: Is there any tool that i can prove to customer that spark is faster then hive ? Wed, 12 Aug, 13:42
Gourav Sengupta Re: blogs/articles/videos on how to analyse spark performance Wed, 19 Aug, 15:34
Gourav Sengupta Re: Exception when S3 path contains colons Tue, 25 Aug, 11:47
Guru Medasani Re: Topology.py -- Cannot run on Spark Gateway on Cloudera 5.4.4. Tue, 04 Aug, 02:27
Guru Medasani Re: Spark-Submit error Tue, 04 Aug, 03:08
Guru Medasani Re: Spark-Submit error Tue, 04 Aug, 04:24
Guru Medasani Re: Spark + Jupyter (IPython Notebook) Tue, 18 Aug, 15:38
Guru Medasani Re: Spark + Jupyter (IPython Notebook) Wed, 19 Aug, 05:23
Guy Hadash Spark SQL Partition discovery - schema evolution Tue, 18 Aug, 08:39
Hafiz Mujadid giving offset in spark sql Tue, 04 Aug, 14:04
Hafiz Mujadid Writing test case for spark streaming checkpointing Thu, 27 Aug, 15:14
Hafiz Mujadid Re: Is there any way to connect cassandra without spark-cassandra connector? Fri, 28 Aug, 06:43
Han JU Re: How to distribute non-serializable object in transform task or broadcast ? Fri, 07 Aug, 15:28
Hao Ren How to distribute non-serializable object in transform task or broadcast ? Fri, 07 Aug, 09:39
Hao Ren ClosureCleaner does not work for java code Mon, 10 Aug, 15:32
Hari Shreedharan Re: Spark Streaming failing on YARN Cluster Wed, 19 Aug, 16:50
Haripriya Ayyalasomayajula Re: Wish for 1.4: upper bound on # tasks in Mesos Tue, 11 Aug, 06:26
Haripriya Ayyalasomayajula Re: Controlling number of executors on Mesos vs YARN Tue, 11 Aug, 06:38
Haripriya Ayyalasomayajula Re: Controlling number of executors on Mesos vs YARN Tue, 11 Aug, 13:21
Harsha HN Difference btw MEMORY_ONLY and MEMORY_AND_DISK Tue, 18 Aug, 07:15
Hayri Volkan Agun Fwd: MLLIB MulticlassMetrics Unable to find class key Wed, 05 Aug, 09:55
Hayri Volkan Agun Label based MLLib MulticlassMetrics is buggy Wed, 05 Aug, 13:19
Heath Guo Pause Spark Streaming reading or sampling streaming data Wed, 05 Aug, 23:50
Heath Guo Re: Pause Spark Streaming reading or sampling streaming data Thu, 06 Aug, 00:31
Hemant Bhanawat Re: Partitioning in spark streaming Wed, 12 Aug, 04:35
Hemant Bhanawat Re: How to minimize shuffling on Spark dataframe Join? Wed, 12 Aug, 05:02
Hemant Bhanawat Re: grouping by a partitioned key Wed, 12 Aug, 06:54
Hemant Bhanawat Re: What is the Effect of Serialization within Stages? Thu, 13 Aug, 04:35
Hemant Bhanawat Re: grouping by a partitioned key Thu, 13 Aug, 05:29
Hemant Bhanawat Re: Streaming on Exponential Data Fri, 14 Aug, 06:00
Hemant Bhanawat Re: Apache Spark - Parallel Processing of messages from Kafka - Java Mon, 17 Aug, 04:33
Hemant Bhanawat Re: registering an empty RDD as a temp table in a PySpark SQL context Tue, 18 Aug, 07:27
Hemant Bhanawat Re: Regarding rdd.collect() Tue, 18 Aug, 07:37
Hemant Bhanawat Re: Regarding rdd.collect() Tue, 18 Aug, 09:11
Hemant Bhanawat Re: global variable in spark streaming with no dependency on key Tue, 18 Aug, 09:17
Hemant Bhanawat Re: persist for DStream Thu, 20 Aug, 08:41
Hemant Bhanawat Re: How to overwrite partition when writing Parquet? Thu, 20 Aug, 08:59
Hemant Bhanawat Re: spark.sql.shuffle.partitions=1 seems to be working fine but creates timeout for large skewed data Thu, 20 Aug, 09:13
Hemant Bhanawat Re: spark.sql.shuffle.partitions=1 seems to be working fine but creates timeout for large skewed data Thu, 20 Aug, 12:29
Hemant Bhanawat Re: PySpark concurrent jobs using single SparkContext Fri, 21 Aug, 08:47
Hemant Bhanawat Re: How to set environment of worker applications Sun, 23 Aug, 14:46
Hemant Bhanawat Re: How to set environment of worker applications Mon, 24 Aug, 07:30
Hemant Bhanawat Re: Joining using mulitimap or array Mon, 24 Aug, 13:54
Hemant Bhanawat Re: How to set environment of worker applications Tue, 25 Aug, 06:57
Hemant Bhanawat Re: Exception throws when running spark pi in Intellij Idea that scala.collection.Seq is not found Tue, 25 Aug, 07:15
Hemant Bhanawat Re: Performance issue with Spark join Wed, 26 Aug, 10:02
Hemminger Jeff Alternative to Large Broadcast Variables Fri, 28 Aug, 10:39
Hemminger Jeff Re: Alternative to Large Broadcast Variables Sun, 30 Aug, 02:27
Hien Luu Re: Spark job workflow engine recommendations Fri, 07 Aug, 15:49
Hien Luu Re: Newbie question: what makes Spark run faster than MapReduce Fri, 07 Aug, 16:29
Hien Luu Re: Spark job workflow engine recommendations Fri, 07 Aug, 18:23
Hien Luu Re: Spark job workflow engine recommendations Tue, 11 Aug, 17:30
Holden Karau Re: QueueStream Does Not Support Checkpointing Fri, 14 Aug, 23:03
Holden Karau Re: types allowed for saveasobjectfile? Thu, 27 Aug, 21:10
Holden Karau Re: types allowed for saveasobjectfile? Thu, 27 Aug, 21:31
Holden Karau Re: tweet transformation ideas Fri, 28 Aug, 03:44
Huang, Jie [SparkScore]Performance portal for Apache Spark - WW31 Mon, 03 Aug, 01:21
Hyukjin Kwon Inquery about contributing codes Tue, 11 Aug, 03:02
Ian Wood Checkpointing in Spark without Streaming Mon, 31 Aug, 20:39
Igor Berman Re: How to increase parallelism of a Spark cluster? Sun, 02 Aug, 17:52
Igor Berman Re: How to increase parallelism of a Spark cluster? Sun, 02 Aug, 21:41
Igor Berman Re: About memory leak in spark 1.4.1 Mon, 03 Aug, 11:56
Igor Berman Re: About memory leak in spark 1.4.1 Tue, 04 Aug, 11:13
Igor Berman Re: Combining Spark Files with saveAsTextFile Wed, 05 Aug, 06:47
Igor Berman Re: Combining Spark Files with saveAsTextFile Wed, 05 Aug, 10:06
Igor Berman Re: Enum values in custom objects mess up RDD operations Thu, 06 Aug, 08:59
Igor Berman Re: All masters are unresponsive! Giving up. Fri, 07 Aug, 18:08
Igor Berman Re: Spark Job Hangs on our production cluster Tue, 11 Aug, 20:31
Igor Berman Re: how do I execute a job on a single worker node in standalone mode Tue, 18 Aug, 11:49
Igor Berman Re: Strange shuffle behaviour difference between Zeppelin and Spark-shell Wed, 19 Aug, 11:36
Igor Berman Re: Strange shuffle behaviour difference between Zeppelin and Spark-shell Wed, 19 Aug, 11:58
Igor Berman Re: blogs/articles/videos on how to analyse spark performance Wed, 19 Aug, 16:30
Igor Berman Re: Help Explain Tasks in WebUI:4040 Mon, 31 Aug, 07:06
Igor Berman Re: spark-submit issue Mon, 31 Aug, 07:11
Igor Berman Re: spark-submit issue Mon, 31 Aug, 08:09
Igor Berman Re: Parallel execution of RDDs Mon, 31 Aug, 14:07
Ilya Karpov Joining using mulitimap or array Mon, 24 Aug, 09:21
Ilya Karpov Re: Joining using mulitimap or array Mon, 24 Aug, 13:04
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · Next »Thread · Author · Date
Box list
Sep 2020109
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137