spark-user mailing list archives: January 2015

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · 22 · Next »Thread · Author · Date
Danny Yates Can Spark benefit from Hive-like partitions? Mon, 26 Jan, 13:40
Danny Yates Re: Can Spark benefit from Hive-like partitions? Mon, 26 Jan, 16:47
Danny Yates Re: Can Spark benefit from Hive-like partitions? Mon, 26 Jan, 23:25
Danny Yates ETL process design Wed, 28 Jan, 09:40
Darin McBeath Confused about shuffle read and shuffle write Tue, 20 Jan, 19:58
Darin McBeath Confused about shuffle read and shuffle write Wed, 21 Jan, 13:38
Darin McBeath Problems saving a large RDD (1 TB) to S3 as a sequence file Fri, 23 Jan, 20:04
Darin McBeath Re: Problems saving a large RDD (1 TB) to S3 as a sequence file Fri, 23 Jan, 22:28
Dave Error for first run from iPython Notebook Mon, 19 Jan, 18:50
Dave Re: Error for first run from iPython Notebook Tue, 20 Jan, 14:48
Dave Re: Error for first run from iPython Notebook Wed, 21 Jan, 14:56
David Jones Using Spark SQL with multiple (avro) files Wed, 14 Jan, 09:34
David Jones Re: Using Spark SQL with multiple (avro) files Wed, 14 Jan, 15:53
David Jones Re: Using Spark SQL with multiple (avro) files Thu, 15 Jan, 12:57
David McWhorter configuring spark.yarn.driver.memoryOverhead on Spark 1.2.0 Mon, 12 Jan, 16:01
David McWhorter Re: configuring spark.yarn.driver.memoryOverhead on Spark 1.2.0 Mon, 12 Jan, 16:37
David Rosenstrauch Re: no snappyjava in java.library.path Mon, 12 Jan, 20:40
Davies Liu Re: Shuffle Problems in 1.2.0 Tue, 06 Jan, 08:46
Davies Liu Re: Shuffle Problems in 1.2.0 Tue, 06 Jan, 20:29
Davies Liu Re: Shuffle Problems in 1.2.0 Wed, 07 Jan, 18:53
Davies Liu Re: Is It Feasible for Spark 1.1 Broadcast to Fully Utilize the Ethernet Card Throughput? Fri, 09 Jan, 18:59
Davies Liu Re: save spark streaming output to single file on hdfs Tue, 13 Jan, 18:15
Davies Liu Re: save spark streaming output to single file on hdfs Tue, 13 Jan, 19:08
Davies Liu Re: spark crashes on second or third call first() on file Thu, 15 Jan, 17:58
Davies Liu Re: Scala vs Python performance differences Fri, 16 Jan, 19:03
Davies Liu Re: Processing .wav files in PySpark Fri, 16 Jan, 23:48
Davies Liu Re: Can I save RDD to local file system and then read it back on spark cluster with multiple nodes? Wed, 21 Jan, 00:08
Davies Liu Re: Spark 1.1 (slow, working), Spark 1.2 (fast, freezing) Wed, 21 Jan, 06:17
Davies Liu Re: spark 1.2 three times slower than spark 1.1, why? Wed, 21 Jan, 07:04
Davies Liu Re: Spark 1.1 (slow, working), Spark 1.2 (fast, freezing) Wed, 21 Jan, 07:21
Davies Liu Re: Spark 1.1 (slow, working), Spark 1.2 (fast, freezing) Wed, 21 Jan, 18:33
Davies Liu Re: spark 1.2 three times slower than spark 1.1, why? Wed, 21 Jan, 20:33
Davies Liu Re: Spark 1.1 (slow, working), Spark 1.2 (fast, freezing) Thu, 22 Jan, 01:20
Davies Liu Re: Using third party libraries in pyspark Fri, 23 Jan, 05:09
Davies Liu Re: Large number of pyspark.daemon processes Sat, 24 Jan, 07:52
Davies Liu Re: [documentation] Update the python example ALS of the site? Tue, 27 Jan, 18:50
Davies Liu Re: NegativeArraySizeException in pyspark when loading an RDD pickleFile Tue, 27 Jan, 18:55
Davies Liu Re: NegativeArraySizeException in pyspark when loading an RDD pickleFile Wed, 28 Jan, 18:01
Davies Liu Re: Error when get data from hive table. Use python code. Fri, 30 Jan, 04:10
Davies Liu Re: Define size partitions Fri, 30 Jan, 19:22
Dean Wampler Re: Spark Project Fails to run multicore in local mode. Thu, 08 Jan, 19:57
Dean Wampler Re: [SQL] Self join with ArrayType columns problems Mon, 26 Jan, 13:44
Debasish Das Re: Low Level Kafka Consumer for Spark Sat, 17 Jan, 04:13
Deep Pradhan No Output Sat, 17 Jan, 09:10
Deep Pradhan Re: No Output Sun, 18 Jan, 09:15
Deep Pradhan Re: No Output Sun, 18 Jan, 09:22
Deep Pradhan Bind Exception Tue, 20 Jan, 04:11
Deep Pradhan Re: Bind Exception Tue, 20 Jan, 04:22
Deep Pradhan Re: Bind Exception Tue, 20 Jan, 04:27
Deep Pradhan Re: Bind Exception Tue, 20 Jan, 05:00
Deep Pradhan While Loop Sat, 24 Jan, 05:02
Deep Pradhan Spark on Gordon Sat, 31 Jan, 14:54
Denis Mikhalkin Analyzing data from non-standard data sources (e.g. AWS Redshift) Sat, 24 Jan, 11:43
Denis Mikhalkin Re: Analyzing data from non-standard data sources (e.g. AWS Redshift) Sun, 25 Jan, 09:19
Derrick Burns Re: spark challenge: zip with next??? Fri, 30 Jan, 20:47
Dhiraj Peechara Re: Elastic allocation(spark.dynamicAllocation.enabled) results in task never being executed. Thu, 08 Jan, 03:14
Dibyendu Bhattacharya Re: Low Level Kafka Consumer for Spark Fri, 16 Jan, 06:20
Dibyendu Bhattacharya Re: Low Level Kafka Consumer for Spark Fri, 16 Jan, 06:53
Dibyendu Bhattacharya Re: Low Level Kafka Consumer for Spark Sat, 17 Jan, 05:38
Dibyendu Bhattacharya Re: ReliableKafkaReceiver stopped receiving data after WriteAheadLogBasedBlockHandler throws TimeoutException Sun, 18 Jan, 17:32
Dibyendu Bhattacharya Re: Spark Streaming with Kafka Wed, 21 Jan, 08:32
Dilip Movva Re: Joining by values Sun, 04 Jan, 04:54
Dinesh Vallabhdas A spark newbie question Sun, 04 Jan, 16:28
Divyansh Jain Saving a mllib model in Spark SQL Tue, 20 Jan, 13:34
Divyansh Jain Re: Saving a mllib model in Spark SQL Thu, 22 Jan, 05:04
Dmitriy Lyubimov Task result deserialization error (1.1.0) Wed, 21 Jan, 02:36
Eduardo Alfaia R: Spark Streaming with Kafka Sun, 18 Jan, 17:57
Eduardo Costa Alfaia KafkaWordCount Fri, 30 Jan, 18:58
Eduardo Costa Alfaia Error Compiling Sat, 31 Jan, 00:00
Eduardo Cusa Play Scala Spark Exmaple Fri, 09 Jan, 13:47
Eduardo Cusa Re: Play Scala Spark Exmaple Mon, 12 Jan, 12:32
Edwin Apache Spark broadcast error: Error sending message as driverActor is null [message = UpdateBlockInfo(BlockManagerId Thu, 22 Jan, 17:52
Emre Sevinc Exception when using HttpSolrServer (httpclient) from within Spark Streaming: java.lang.NoSuchMethodError: org.apache.http.impl.conn.SchemeRegistryFactory.createSystemDefault()Lorg/apache/http/conn/scheme/SchemeRegistry Wed, 28 Jan, 12:59
Emre Sevinc Re: Exception when using HttpSolrServer (httpclient) from within Spark Streaming: java.lang.NoSuchMethodError: org.apache.http.impl.conn.SchemeRegistryFactory.createSystemDefault()Lorg/apache/http/conn/scheme/SchemeRegistry Wed, 28 Jan, 14:19
Emre Sevinc Re: Exception when using HttpSolrServer (httpclient) from within Spark Streaming: java.lang.NoSuchMethodError: org.apache.http.impl.conn.SchemeRegistryFactory.createSystemDefault()Lorg/apache/http/conn/scheme/SchemeRegistry Wed, 28 Jan, 16:24
Emre Sevinc Re: Exception when using HttpSolrServer (httpclient) from within Spark Streaming: java.lang.NoSuchMethodError: org.apache.http.impl.conn.SchemeRegistryFactory.createSystemDefault()Lorg/apache/http/conn/scheme/SchemeRegistry Thu, 29 Jan, 08:44
Enno Shioji Re: Big performance difference between "client" and "cluster" deployment mode; is this expected? Thu, 01 Jan, 07:52
Enno Shioji Better way of measuring custom application metrics Sat, 03 Jan, 23:47
Enno Shioji Re: Better way of measuring custom application metrics Sun, 04 Jan, 09:45
Enno Shioji TestSuiteBase based unit test using a sliding window join timesout Wed, 07 Jan, 11:16
Enno Shioji Re: Registering custom metrics Thu, 08 Jan, 16:30
Enno Shioji Re: Problems with Spark Core 1.2.0 SBT project in IntelliJ Tue, 13 Jan, 21:45
Enno Shioji Re: Problems with Spark Core 1.2.0 SBT project in IntelliJ Wed, 14 Jan, 07:52
Eric Zhen Driver hangs on running mllib word2vec Mon, 05 Jan, 07:18
Eric Zhen Re: Driver hangs on running mllib word2vec Tue, 06 Jan, 03:47
Eric Zhen Re: Driver hangs on running mllib word2vec Tue, 06 Jan, 06:59
Ethan Wolf Spark Framework handling of Mesos master change Mon, 12 Jan, 20:44
Evan R. Sparks Re: Spark on teradata? Thu, 08 Jan, 20:01
Federico Ragona Worker never used by our Spark applications Mon, 26 Jan, 08:58
Federico Ragona Worker never used by our Spark applications Mon, 26 Jan, 09:49
Felix C Re: Error for first run from iPython Notebook Tue, 20 Jan, 22:35
Felix C Re: Using third party libraries in pyspark Fri, 23 Jan, 06:53
Felix C RE: schemaRDD.saveAsParquetFile creates large number of small parquet files ... Fri, 30 Jan, 07:02
Fengyun RAO Re: How to share a NonSerializable variable among tasks in the same worker node? Wed, 21 Jan, 06:10
Fengyun RAO spark 1.2 three times slower than spark 1.1, why? Wed, 21 Jan, 06:39
Fengyun RAO Re: spark 1.2 three times slower than spark 1.1, why? Wed, 21 Jan, 07:13
Fengyun RAO Re: spark 1.2 three times slower than spark 1.1, why? Wed, 21 Jan, 08:30
Fengyun RAO Re: spark 1.2 three times slower than spark 1.1, why? Wed, 21 Jan, 08:38
Fengyun RAO Re: spark 1.2 three times slower than spark 1.1, why? Wed, 21 Jan, 08:41
Fengyun RAO Re: spark 1.2 three times slower than spark 1.1, why? Wed, 21 Jan, 09:40
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · 22 · Next »Thread · Author · Date
Box list
Sep 202197
Aug 2021171
Jul 2021158
Jun 2021179
May 2021187
Apr 2021267
Mar 2021346
Feb 2021166
Jan 2021242
Dec 2020203
Nov 2020147
Oct 2020236
Sep 2020136
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137