spark-user mailing list archives: April 2014

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · Next »Thread · Author · Date
Jeyaraj, Arockia R (Arockia) RE: To Ten RDD Wed, 09 Apr, 15:18
Jim Blomo Re: pySpark memory usage Wed, 09 Apr, 22:52
Jim Blomo Re: pySpark memory usage Thu, 10 Apr, 02:04
Jim Blomo Re: Spark - ready for prime time? Sun, 13 Apr, 15:33
Jim Blomo Finding bad data Fri, 25 Apr, 01:15
Jim Blomo Re: pySpark memory usage Mon, 28 Apr, 20:01
Jim Carroll Continuously running non-streaming jobs Thu, 17 Apr, 18:02
Jim Carroll Re: Continuously running non-streaming jobs Thu, 17 Apr, 19:11
Jim Carroll stdout in workers Mon, 21 Apr, 17:59
Joe L what is the difference between persist() and cache()? Sun, 13 Apr, 14:26
Joe L how to use a single filter instead of multiple filters Sun, 13 Apr, 16:39
Joe L how to count maps without shuffling too much data? Mon, 14 Apr, 02:53
Joe L How to set spark worker memory size? Mon, 14 Apr, 03:02
Joe L how to count maps within a node? Mon, 14 Apr, 03:58
Joe L Proper caching method Mon, 14 Apr, 12:32
Joe L shuffle vs performance Tue, 15 Apr, 03:47
Joe L groupByKey returns a single partition in a RDD? Tue, 15 Apr, 07:22
Joe L what is the difference between element and partition? Wed, 16 Apr, 04:53
Joe L groupByKey(None) returns partitions according to the keys? Wed, 16 Apr, 04:58
Joe L Could I improve Spark performance partitioning elements in a RDD? Wed, 16 Apr, 06:30
Joe L what is a partition? how it works? Wed, 16 Apr, 07:29
Joe L choose the number of partition according to the number of nodes Wed, 16 Apr, 23:50
Joe L Re: choose the number of partition according to the number of nodes Thu, 17 Apr, 00:40
Joe L join with inputs co-partitioned? Fri, 18 Apr, 05:40
Joe L how to split one big RDD (about 100G) into several small ones? Fri, 18 Apr, 11:58
Joe L efficient joining Sun, 20 Apr, 03:55
Joe L evaluate spark Sun, 20 Apr, 21:03
Joe L Spark is slow Mon, 21 Apr, 18:23
Joe L Re: Spark is slow Tue, 22 Apr, 02:42
Joe L help me Tue, 22 Apr, 11:15
Joe L help Wed, 23 Apr, 09:04
Joe L read file from hdfs Fri, 25 Apr, 12:38
Joe L strange error Fri, 25 Apr, 14:10
Joe L help Fri, 25 Apr, 18:17
Joe L Re: help Fri, 25 Apr, 19:32
Joe L help Sun, 27 Apr, 17:17
Joe L spark running examples error Mon, 28 Apr, 02:32
Joe L getting an error Mon, 28 Apr, 09:43
Joe L RE: help Tue, 29 Apr, 01:25
John King Re: Deploying a python code on a spark EC2 cluster Thu, 24 Apr, 18:36
John King Trying to use pyspark mllib NaiveBayes Thu, 24 Apr, 18:38
John King Spark mllib throwing error Thu, 24 Apr, 18:49
John King Re: Spark mllib throwing error Thu, 24 Apr, 20:55
John King Re: Trying to use pyspark mllib NaiveBayes Thu, 24 Apr, 20:57
John King Re: Deploying a python code on a spark EC2 cluster Thu, 24 Apr, 21:00
John King Re: Trying to use pyspark mllib NaiveBayes Thu, 24 Apr, 23:04
John King Re: Spark mllib throwing error Thu, 24 Apr, 23:05
John King Re: Trying to use pyspark mllib NaiveBayes Thu, 24 Apr, 23:14
John King Re: Spark mllib throwing error Thu, 24 Apr, 23:56
John King Running out of memory Naive Bayes Sat, 26 Apr, 02:06
John King Re: Running out of memory Naive Bayes Sat, 26 Apr, 12:49
John King Re: Running out of memory Naive Bayes Sun, 27 Apr, 22:33
John Meagher Re: Spark is slow Mon, 21 Apr, 18:54
John Salvatier How are exceptions in map functions handled in Spark? Fri, 04 Apr, 17:40
John Salvatier Re: How are exceptions in map functions handled in Spark? Fri, 04 Apr, 18:49
John Salvatier Re: How are exceptions in map functions handled in Spark? Fri, 04 Apr, 18:57
Jonathan Chayat Using Spark in IntelliJ Scala Console Sat, 26 Apr, 17:47
Jonathan Chayat Re: Using Spark in IntelliJ Scala Console Sat, 26 Apr, 20:05
Jonathan Chayat Re: Using Spark in IntelliJ Scala Console Sun, 27 Apr, 05:35
Josh Mahonin Re: Spark and HBase Fri, 25 Apr, 19:17
Josh Mahonin Re: Spark and HBase Sat, 26 Apr, 14:00
Josh Marcus Re: Is Spark a good choice for geospatial/GIS applications? Is a community volunteer needed in this area? Wed, 23 Apr, 18:24
Josh Rosen Re: Build times for Spark Fri, 25 Apr, 20:27
K Koh Efficient way to aggregate event data at daily/weekly/monthly level Thu, 03 Apr, 00:22
Kalpit Shah Re: Changing number of workers for benchmarking purposes Sat, 12 Apr, 16:31
Kanwaldeep Re: Using ProtoBuf 2.5 for messages with Spark Streaming Tue, 01 Apr, 17:36
Kanwaldeep Re: Using ProtoBuf 2.5 for messages with Spark Streaming Tue, 01 Apr, 18:26
Kanwaldeep Re: Using ProtoBuf 2.5 for messages with Spark Streaming Wed, 09 Apr, 20:03
Kanwaldeep KafkaInputDStream Stops reading new messages Wed, 09 Apr, 20:57
Ken Ellinwood /bin/java not found: JAVA_HOME ignored launching shark executor Thu, 10 Apr, 19:02
Ken Ellinwood Re: /bin/java not found: JAVA_HOME ignored launching shark executor Thu, 10 Apr, 19:14
Kevin Markey Re: Is there a way to get the current progress of the job? Tue, 01 Apr, 18:18
Kevin Markey Re: Job initialization performance of Spark standalone mode vs YARN Thu, 03 Apr, 18:19
Koert Kuipers Re: Generic types and pair RDDs Tue, 01 Apr, 21:06
Koert Kuipers ui broken in latest 1.0.0 Mon, 07 Apr, 20:06
Koert Kuipers Re: ui broken in latest 1.0.0 Mon, 07 Apr, 20:21
Koert Kuipers RDDInfo visibility SPARK-1132 Tue, 08 Apr, 00:05
Koert Kuipers Re: RDDInfo visibility SPARK-1132 Tue, 08 Apr, 00:54
Koert Kuipers Re: ui broken in latest 1.0.0 Tue, 08 Apr, 16:33
Koert Kuipers assumption that lib_managed is present Tue, 08 Apr, 16:54
Koert Kuipers Re: ui broken in latest 1.0.0 Tue, 08 Apr, 16:55
Koert Kuipers Re: ui broken in latest 1.0.0 Tue, 08 Apr, 16:57
Koert Kuipers Re: ui broken in latest 1.0.0 Tue, 08 Apr, 17:07
Koert Kuipers Re: ui broken in latest 1.0.0 Tue, 08 Apr, 18:13
Koert Kuipers Re: ui broken in latest 1.0.0 Tue, 08 Apr, 18:25
Koert Kuipers Re: ui broken in latest 1.0.0 Tue, 08 Apr, 18:26
Koert Kuipers Re: ui broken in latest 1.0.0 Tue, 08 Apr, 20:20
Koert Kuipers Re: ui broken in latest 1.0.0 Tue, 08 Apr, 20:30
Koert Kuipers Re: Anyone using value classes in RDDs? Fri, 18 Apr, 23:13
Koert Kuipers Re: ui broken in latest 1.0.0 Sat, 19 Apr, 14:45
Koert Kuipers Re: Storage information about an RDD from the API Tue, 29 Apr, 16:41
Koert Kuipers Re: Spark: issues with running a sbt fat jar due to akka dependencies Tue, 29 Apr, 19:37
Konstantin Kudryavtsev Re: Spark output compression on HDFS Thu, 03 Apr, 19:28
Konstantin Kudryavtsev Re: how to save RDD partitions in different folders? Fri, 04 Apr, 15:05
Konstantin Kudryavtsev Re: Spark output compression on HDFS Fri, 04 Apr, 15:06
Konstantin Kudryavtsev Re: is it possible to initiate Spark jobs from Oozie? Thu, 10 Apr, 09:56
Konstantin Kudryavtsev Re: Pig on Spark Thu, 10 Apr, 10:07
Kostiantyn Kudriavtsev Spark output compression on HDFS Wed, 02 Apr, 19:18
Kostiantyn Kudriavtsev Re: using saveAsNewAPIHadoopFile with OrcOutputFormat Wed, 16 Apr, 17:15
Krakna H Status of MLI? Wed, 02 Apr, 02:38
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · Next »Thread · Author · Date
Box list
Jun 2019193
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137