spark-user mailing list archives: August 2016

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · Next »Thread · Author · Date
Davies Liu Re: Spark 1.6.2 can read hive tables created with sqoop, but Spark 2.0.0 cannot Tue, 09 Aug, 22:01
Davies Liu Re: DataFrame equivalent to RDD.partionByKey Tue, 09 Aug, 22:42
Davies Liu Re: Spark SQL concurrent runs fails with java.util.concurrent.TimeoutException: Futures timed out after [300 seconds] Fri, 19 Aug, 21:08
Davies Liu Re: OOM with StringIndexer, 800m rows & 56m distinct value column Fri, 19 Aug, 21:33
Davies Liu Re: OOM with StringIndexer, 800m rows & 56m distinct value column Fri, 19 Aug, 21:34
Deepak Sharma Re: What are using Spark for Tue, 02 Aug, 18:13
Deepak Sharma Re: Spark jobs failing due to java.lang.OutOfMemoryError: PermGen space Thu, 04 Aug, 14:43
Deepak Sharma Re: Spark jobs failing due to java.lang.OutOfMemoryError: PermGen space Thu, 04 Aug, 14:54
Deepak Sharma Long running tasks in stages Sat, 06 Aug, 17:31
Deepak Sharma Re: Is Spark right for my use case? Mon, 08 Aug, 07:36
Deepak Sharma Re: What are the configurations needs to connect spark and ms-sql server? Mon, 08 Aug, 08:03
Deepak Sharma Best practises around spark-scala Mon, 08 Aug, 15:11
Deepak Sharma Re: Best practises around spark-scala Mon, 08 Aug, 15:46
Deepak Sharma Re: Spark join and large temp files Mon, 08 Aug, 18:31
Deepak Sharma Re: SPARK SQL READING FROM HIVE Mon, 08 Aug, 18:51
Deepak Sharma Use cases around image/video processing in spark Wed, 10 Aug, 15:20
Deepak Sharma Re: Spark 2.0 - Join statement compile error Tue, 23 Aug, 05:02
Deepak Sharma Re: Spark 2.0 - Join statement compile error Tue, 23 Aug, 06:31
Deepak Sharma Re: Controlling access to hive/db-tables while using SparkSQL Tue, 30 Aug, 15:26
Denis Bolshakov Re: How to acess the WrappedArray Mon, 29 Aug, 10:44
Denis Bolshakov Re: After calling persist, why the size in sparkui is not matching with the actual file size Mon, 29 Aug, 15:32
Denny Lee Re: Spark GraphFrames Tue, 02 Aug, 17:41
Devi P.V How to connect Power BI to Apache Spark on local machine? Thu, 04 Aug, 06:54
Devi P.V What are the configurations needs to connect spark and ms-sql server? Mon, 08 Aug, 07:44
Devi P.V Spark MLlib:Collaborative Filtering Wed, 24 Aug, 08:28
Devi P.V Re: Spark MLlib:Collaborative Filtering Thu, 25 Aug, 04:43
Devi P.V Re: How to install spark with s3 on AWS? Fri, 26 Aug, 12:13
Dibyendu Bhattacharya Latest Release of Receiver based Kafka Consumer for Spark Streaming. Thu, 25 Aug, 11:33
Dibyendu Bhattacharya Re: Latest Release of Receiver based Kafka Consumer for Spark Streaming. Thu, 25 Aug, 13:45
Divya Gehlot Spark GraphFrames Tue, 02 Aug, 05:50
Divya Gehlot [Spark1.6]:compare rows and add new column based on lookup Fri, 05 Aug, 02:16
Divya Gehlot Re: [Spark1.6]:compare rows and add new column based on lookup Fri, 05 Aug, 02:48
Divya Gehlot [Spark1.6] Or (||) operator not working in DataFrame Sun, 07 Aug, 14:43
Divya Gehlot Re: [Spark1.6] Or (||) operator not working in DataFrame Mon, 08 Aug, 03:36
Divya Gehlot [Spark 1.6]-increment value column based on condition + Dataframe Tue, 09 Aug, 12:34
Divya Gehlot Re: Getting a TreeNode Exception while saving into Hadoop Thu, 18 Aug, 02:42
Diwakar Dhanuskodi Spark streaming not processing messages from partitioned topics Tue, 09 Aug, 20:47
Diwakar Dhanuskodi Re: Spark streaming not processing messages from partitioned topics Wed, 10 Aug, 03:20
Diwakar Dhanuskodi Re: Spark streaming not processing messages from partitioned topics Wed, 10 Aug, 04:51
Diwakar Dhanuskodi Re: Spark streaming not processing messages from partitioned topics Wed, 10 Aug, 04:56
Diwakar Dhanuskodi Re: Spark streaming not processing messages from partitioned topics Wed, 10 Aug, 07:07
Diwakar Dhanuskodi Re: Spark streaming not processing messages from partitioned topics Wed, 10 Aug, 14:40
Diwakar Dhanuskodi Re: Spark streaming not processing messages from partitioned topics Wed, 10 Aug, 19:15
Diwakar Dhanuskodi Re: Spark streaming not processing messages from partitioned topics Thu, 11 Aug, 18:33
Diwakar Dhanuskodi KafkaUtils.createStream not picking smallest offset Fri, 12 Aug, 08:35
Diwakar Dhanuskodi Re: KafkaUtils.createStream not picking smallest offset Sun, 14 Aug, 01:52
Diwakar Dhanuskodi RE: [Spark 2.0] ClassNotFoundException is thrown when using Hive Thu, 18 Aug, 11:56
Diwakar Dhanuskodi createDirectStream parallelism Thu, 18 Aug, 14:22
Diwakar Dhanuskodi Spark streaming Fri, 19 Aug, 02:04
Diwakar Dhanuskodi Best way to read XML data from RDD Fri, 19 Aug, 20:07
Diwakar Dhanuskodi Re: Best way to read XML data from RDD Sat, 20 Aug, 04:41
Diwakar Dhanuskodi Re: Best way to read XML data from RDD Mon, 22 Aug, 10:49
Diwakar Dhanuskodi Re: Best way to read XML data from RDD Mon, 22 Aug, 10:52
Diwakar Dhanuskodi Re: Best way to read XML data from RDD Mon, 22 Aug, 10:53
Diwakar Dhanuskodi Re: Best way to read XML data from RDD Mon, 22 Aug, 15:29
Diwakar Dhanuskodi Spark build 1.6.2 error Tue, 30 Aug, 20:30
Dominik Safaric Spark Streaming fault tolerance benchmark Sat, 13 Aug, 14:50
Don Drake Spark 2.0 - Parquet data with fields containing periods "." Wed, 31 Aug, 17:48
Dragisa Krsmanovic [2.0.0] mapPartitions on DataFrame unable to find encoder Tue, 02 Aug, 20:55
Dragisa Krsmanovic Re: [2.0.0] mapPartitions on DataFrame unable to find encoder Tue, 02 Aug, 23:59
Dragisa Krsmanovic Re: [2.0.0] mapPartitions on DataFrame unable to find encoder Wed, 03 Aug, 16:35
Efe Selcuk Spark2 SBT Assembly Wed, 10 Aug, 17:39
Efe Selcuk Re: Spark2 SBT Assembly Wed, 10 Aug, 17:50
Efe Selcuk Re: Spark2 SBT Assembly Wed, 10 Aug, 21:59
Efe Selcuk Re: Spark2 SBT Assembly Thu, 11 Aug, 18:29
Efe Selcuk [Spark2] Error writing "complex" type to CSV Thu, 18 Aug, 21:32
Efe Selcuk Re: [Spark2] Error writing "complex" type to CSV Fri, 19 Aug, 00:27
Efe Selcuk Re: [Spark2] Error writing "complex" type to CSV Fri, 19 Aug, 17:54
Efe Selcuk "Schemaless" Spark Fri, 19 Aug, 21:54
Eike von Seggern Re: pyspark pickle error when using itertools.groupby Fri, 05 Aug, 09:24
Eric Ho how to do nested loops over 2 arrays but use Two RDDs instead ? Mon, 15 Aug, 18:12
Eric Ho How to do nested for-each loops across RDDs ? Mon, 15 Aug, 20:15
Eric Ho Re: How to do nested for-each loops across RDDs ? Mon, 15 Aug, 20:29
Eric Ho Do we still need to use Kryo serializer in Spark 1.6.2 ? Mon, 22 Aug, 18:00
Eric Ho Spark to Kafka communication encrypted ? Wed, 31 Aug, 07:03
Ethan Aubin Pyspark SQL 1.6.0 write problem Thu, 25 Aug, 15:00
Evan Chan [Community] Python support added to Spark Job Server Wed, 17 Aug, 17:04
Evan Zamir Re: How to add custom steps to Pipeline models? Mon, 15 Aug, 04:27
Everett Anderson Plans for improved Spark DataFrame/Dataset unit testing? Mon, 01 Aug, 16:02
Everett Anderson Re: Java and SparkSession Fri, 05 Aug, 18:03
Everett Anderson Submitting jobs to YARN from outside EMR -- config & S3 impl Mon, 15 Aug, 16:20
Everett Anderson Re: Plans for improved Spark DataFrame/Dataset unit testing? Fri, 19 Aug, 23:25
Everett Anderson Re: Plans for improved Spark DataFrame/Dataset unit testing? Sun, 21 Aug, 16:30
Everett Anderson S3A + EMR failure when writing Parquet? Sun, 28 Aug, 19:51
Everett Anderson Re: S3A + EMR failure when writing Parquet? Sun, 28 Aug, 23:19
Everett Anderson Re: S3A + EMR failure when writing Parquet? Mon, 29 Aug, 17:18
Everett Anderson Does Spark on YARN inherit or replace the Hadoop/YARN configs? Tue, 30 Aug, 17:38
Ewan Leith Re: Spark 2.0.0 - Apply schema on few columns of dataset Mon, 08 Aug, 05:56
Ewan Leith Re: zip for pyspark Mon, 08 Aug, 20:24
Felix Cheung Re: SparkR error when repartition is called Tue, 09 Aug, 09:15
Felix Cheung Re: GraphX build from JSON input Mon, 15 Aug, 21:36
Felix Cheung Re: UDF in SparkR Wed, 17 Aug, 11:12
Felix Cheung Re: pyspark unable to create UDF: java.lang.RuntimeException: org.apache.hadoop.fs.FileAlreadyExistsException: Parent path is not a directory: /tmp tmp Thu, 18 Aug, 22:37
Felix Cheung Re: pyspark unable to create UDF: java.lang.RuntimeException: org.apache.hadoop.fs.FileAlreadyExistsException: Parent path is not a directory: /tmp tmp Fri, 19 Aug, 00:16
Felix Cheung Re: Best way to read XML data from RDD Sat, 20 Aug, 04:19
Felix Cheung Re: Best way to read XML data from RDD Sat, 20 Aug, 05:05
Felix Cheung Re: Disable logger in SparkR Mon, 22 Aug, 17:40
Felix Cheung Re: spark.lapply in SparkR: Error in writeBin(batch, con, endian = "big") Mon, 22 Aug, 20:47
Felix Cheung Re: spark.lapply in SparkR: Error in writeBin(batch, con, endian = "big") Thu, 25 Aug, 12:34
Felix Cheung Re: spark.lapply in SparkR: Error in writeBin(batch, con, endian = "big") Thu, 25 Aug, 18:00
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · Next »Thread · Author · Date
Box list
Sep 202181
Aug 2021171
Jul 2021158
Jun 2021179
May 2021187
Apr 2021267
Mar 2021346
Feb 2021166
Jan 2021242
Dec 2020203
Nov 2020147
Oct 2020236
Sep 2020136
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137