spark-user mailing list archives: August 2015

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · Next »Thread · Author · Date
Puneet Kapoor Re: flatMap output on disk / flatMap memory overhead Sat, 01 Aug, 09:41
Raghavendra Pandey Re: DataFrame column structure change Sat, 08 Aug, 20:27
Raghavendra Pandey Spark sql jobs n their partition Sat, 08 Aug, 20:34
Raghavendra Pandey Re: How to specify column type when saving DataFrame as parquet file? Fri, 14 Aug, 14:29
Raghavendra Pandey Re: Left outer joining big data set with small lookups Sat, 15 Aug, 02:10
Raghavendra Pandey Re: Spark Sql behaves strangely with tables with a lot of partitions Fri, 21 Aug, 18:09
Raghavendra Pandey Re: Aggregate to array (or 'slice by key') with DataFrames Fri, 21 Aug, 18:11
Raghavendra Pandey Re: How to list all dataframes and RDDs available in current session? Sat, 22 Aug, 03:45
Raghavendra Pandey Re: How to set environment of worker applications Sun, 23 Aug, 15:27
Raghavendra Pandey Re: How to set environment of worker applications Mon, 24 Aug, 15:18
Raghavendra Pandey Re: org.apache.spark.shuffle.FetchFailedException Tue, 25 Aug, 06:06
Raghavendra Pandey Re: Array Out OF Bound Exception Sat, 29 Aug, 14:35
Raghavendra Pandey Re: Alternative to Large Broadcast Variables Sat, 29 Aug, 14:40
Raghavendra Pandey Re: Spark Version upgrade isue:Exception in thread "main" java.lang.NoSuchMethodError Sun, 30 Aug, 06:27
Rajeshkumar J org.apache.spark.SparkException: Detected yarn-cluster mode, but isn't running on a cluster. Deployment to YARN is not supported directly by SparkContext. Please use spark-submit Mon, 03 Aug, 11:47
Rajeshkumar J Fwd: org.apache.spark.SparkException: Detected yarn-cluster mode, but isn't running on a cluster. Deployment to YARN is not supported directly by SparkContext. Please use spark-submit Mon, 03 Aug, 11:53
Rajeshkumar J using Convert function of sql in spark sql Tue, 25 Aug, 11:53
Ramkumar V Spark Streaming failing on YARN Cluster Thu, 13 Aug, 06:50
Ramkumar V Re: Spark Streaming failing on YARN Cluster Thu, 13 Aug, 09:03
Ramkumar V Re: Spark Streaming failing on YARN Cluster Wed, 19 Aug, 08:06
Ramkumar V Re: Spark Streaming failing on YARN Cluster Wed, 19 Aug, 09:21
Ramkumar V Re: Spark Streaming failing on YARN Cluster Wed, 19 Aug, 17:15
Ramkumar V Re: Spark Streaming failing on YARN Cluster Tue, 25 Aug, 11:35
Ranjana Rajendran Fwd: Graphx - how to add vertices to a HashSet of vertices ? Fri, 14 Aug, 18:04
Rares Vernica Set Job Descriptions for Scala application Wed, 05 Aug, 19:29
Ravi Kiran Re: Difference between Sort based and Hash based shuffle Sat, 15 Aug, 23:31
Ravi Mody Re: Failed stages and dropped executors when running implicit matrix factorization/ALS Fri, 21 Aug, 14:44
Ravisankar Mani Exception in spark Wed, 12 Aug, 03:50
Ravisankar Mani Re: Exception in spark Wed, 12 Aug, 05:34
Ravisankar Mani Re: Exception in spark Wed, 12 Aug, 05:40
Ravisankar Mani Exception in spark Fri, 14 Aug, 06:06
Ravisankar Mani Having Clause with variation and stddev Fri, 21 Aug, 12:08
Rex Xiong Is it possible to disable AM page proxy in Yarn client mode? Mon, 03 Aug, 08:52
Reynold Xin Re: Memory allocation error with Spark 1.5 Wed, 05 Aug, 17:19
Reynold Xin Re: Tungsten and sun.misc.Unsafe Fri, 21 Aug, 18:18
Reynold Xin Re: DataFrame. SparkPlan / Project serialization issue: ArrayIndexOutOfBounds. Fri, 21 Aug, 19:14
Reynold Xin Re: How to avoid shuffle errors for a large join ? Sun, 30 Aug, 02:17
Reza Zadeh Re: [MLlib] DIMSUM row similarity? Mon, 31 Aug, 20:31
Richard Marscher Re: Does RDD.cartesian involve shuffling? Tue, 04 Aug, 15:23
Richard Marscher Re: How to increase parallelism of a Spark cluster? Tue, 04 Aug, 16:06
Richard Marscher Re: Does RDD.cartesian involve shuffling? Tue, 04 Aug, 16:30
Richard Marscher Re: Repartition question Tue, 04 Aug, 17:46
Richard Marscher Re: Removing empty partitions before we write to HDFS Thu, 06 Aug, 20:21
Rick Hillegas Re: Reading xml in java using spark Mon, 31 Aug, 14:51
Rick Moritz Re: Wish for 1.4: upper bound on # tasks in Mesos Tue, 11 Aug, 08:11
Rick Moritz Strange shuffle behaviour difference between Zeppelin and Spark-shell Tue, 18 Aug, 14:38
Rick Moritz Strange shuffle behaviour difference between Zeppelin and Spark-shell Wed, 19 Aug, 06:49
Rick Moritz Re: Strange shuffle behaviour difference between Zeppelin and Spark-shell Wed, 19 Aug, 11:12
Rick Moritz Re: Strange shuffle behaviour difference between Zeppelin and Spark-shell Wed, 19 Aug, 11:42
Rick Moritz Fwd: Strange shuffle behaviour difference between Zeppelin and Spark-shell Wed, 19 Aug, 12:47
Rick Moritz Re: build spark 1.4.1 with JDK 1.6 Tue, 25 Aug, 20:59
Rick Moritz Re: build spark 1.4.1 with JDK 1.6 Tue, 25 Aug, 21:14
Rishabh Bhardwaj DataFrame column structure change Fri, 07 Aug, 08:36
Rishabh Bhardwaj Re: DataFrame column structure change Fri, 07 Aug, 09:43
Rishi Yadav Re: Can't understand the size of raw RDD and its DataFrame Sun, 16 Aug, 04:34
Rishi Yadav Re: Re: Can't understand the size of raw RDD and its DataFrame Sun, 16 Aug, 16:08
Rishi Yadav Re: Error: Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/hadoop/hbase/HBaseConfiguration Sun, 16 Aug, 16:55
Rishi Yadav Re: Spark can't fetch application jar after adding it to HTTP server Sun, 16 Aug, 20:12
Rishitesh Mishra Subscribe Mon, 17 Aug, 06:23
Rishitesh Mishra Re: How to list all dataframes and RDDs available in current session? Thu, 20 Aug, 18:36
Rishitesh Mishra Re: Spark streaming multi-tasking during I/O Fri, 21 Aug, 19:21
Rishitesh Mishra Re: Spark driver locality Thu, 27 Aug, 18:53
Rishitesh Mishra Re: Spark driver locality Fri, 28 Aug, 08:55
Rishitesh Mishra Re: RDD from partitions Fri, 28 Aug, 09:35
Rishitesh Mishra Re: Spark SQL vs Spark Programming Mon, 31 Aug, 14:39
Ritesh Kumar Singh Re: Unsupported major.minor version 51.0 Tue, 11 Aug, 20:08
Ritesh Kumar Singh Re: Feasibility Project - Text Processing and Category Classification Fri, 28 Aug, 17:12
Rob Sargent Re: distributing large matrices Fri, 14 Aug, 21:36
Roberto Coluccio Fwd: [Spark + Hive + EMR + S3] Issue when reading from Hive external table backed on S3 with large amount of small files Fri, 07 Aug, 18:18
Roberto Coluccio Unable to catch SparkContext methods exceptions Mon, 24 Aug, 16:09
Roberto Coluccio Re: Unable to catch SparkContext methods exceptions Mon, 24 Aug, 17:52
Roberto Congiu Re: SPARK sql :Need JSON back isntead of roq Fri, 21 Aug, 11:01
Roberto Congiu Re: Local Spark talking to remote HDFS? Mon, 24 Aug, 19:43
Roberto Congiu Re: Local Spark talking to remote HDFS? Tue, 25 Aug, 07:22
Roberto Congiu Re: Local Spark talking to remote HDFS? Tue, 25 Aug, 18:57
Roberto Congiu Re: Where is Redgate's HDFS explorer? Sat, 29 Aug, 10:03
Roberto Congiu Re: Where is Redgate's HDFS explorer? Sat, 29 Aug, 11:43
Robin East Re: Spark return key value pair Wed, 19 Aug, 20:38
Robin East Re: Spark ec2 lunch problem Mon, 24 Aug, 12:58
Robin East Re: Build k-NN graph for large dataset Wed, 26 Aug, 11:51
Robineast Re: SparkR -Graphx Connected components Fri, 07 Aug, 12:10
Robineast Re: SparkR -Graphx Connected components Tue, 11 Aug, 08:42
Robineast Re: how to write any data (non RDD) to a file inside closure? Tue, 18 Aug, 12:41
Robineast Re: Saving and loading MLlib models as standalone (no Hadoop) Thu, 20 Aug, 21:34
Robineast Re: what determine the task size? Fri, 21 Aug, 08:11
Robineast Re: Spark GraphaX Mon, 24 Aug, 05:07
Robineast Re: Graphx CompactBuffer help Fri, 28 Aug, 10:39
Robust_spark Kmeans issues and hierarchical clustering Fri, 28 Aug, 15:03
Romi Kuntsman How to overwrite partition when writing Parquet? Wed, 19 Aug, 14:48
Romi Kuntsman Re: Issues with S3 paths that contain colons Wed, 19 Aug, 19:57
Romi Kuntsman Re: How to minimize shuffling on Spark dataframe Join? Wed, 19 Aug, 20:05
Romi Kuntsman Re: How to overwrite partition when writing Parquet? Thu, 20 Aug, 10:45
Romi Kuntsman How to remove worker node but let it finish first? Mon, 24 Aug, 06:41
Romi Kuntsman Re: Exception when S3 path contains colons Tue, 25 Aug, 12:06
Romi Kuntsman Re: How to remove worker node but let it finish first? Sun, 30 Aug, 05:40
Ruslan Dautkhanov Re: Spark Number of Partitions Recommendations Sat, 01 Aug, 21:14
Ruslan Dautkhanov Re: TCP/IP speedup Sun, 02 Aug, 01:26
Ruslan Dautkhanov Re: collect() works, take() returns "ImportError: No module named iter" Mon, 10 Aug, 22:25
Ruslan Dautkhanov Re: Spark job workflow engine recommendations Tue, 11 Aug, 18:15
Ruslan Dautkhanov Re: Spark Master HA on YARN Sun, 16 Aug, 20:59
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · Next »Thread · Author · Date
Box list
Sep 202074
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137