spark-user mailing list archives: September 2015

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · 22 · Next »Thread · Author · Date
Ted Yu Re: Why is 1 executor overworked and other sit idle? Tue, 22 Sep, 13:15
Adrian Tanase Re: Remove duplicate keys by always choosing first in file. Tue, 22 Sep, 13:15
Adrian Tanase Re: Invalid checkpoint url Tue, 22 Sep, 13:19
java8964 RE: Long GC pauses with Spark SQL 1.3.0 and billion row tables Tue, 22 Sep, 13:19
java8964 RE: Long GC pauses with Spark SQL 1.3.0 and billion row tables Tue, 22 Sep, 13:19
Adrian Tanase Re: Spark Streaming distributed job Tue, 22 Sep, 13:25
ayan guha Py4j issue with Python Kafka Module Tue, 22 Sep, 13:41
Daniel Haviv Re: spark-avro takes a lot time to load thousands of files Tue, 22 Sep, 13:54
sanderg Performance Spark SQL vs Dataframe API faster Tue, 22 Sep, 14:05
oggie spark on mesos gets killed by cgroups for too much memory Tue, 22 Sep, 14:19
Cheng, Hao RE: Performance Spark SQL vs Dataframe API faster Tue, 22 Sep, 14:29
srungarapu vamsi Re: Invalid checkpoint url Tue, 22 Sep, 14:34
gtinside Error while saving parquet Tue, 22 Sep, 14:37
Yana Kadiyska Help getting started with Kafka Tue, 22 Sep, 14:38
Philip Weaver Re: Remove duplicate keys by always choosing first in file. Tue, 22 Sep, 14:50
Cody Koeninger Re: Help getting started with Kafka Tue, 22 Sep, 14:50
java8964 RE: spark-avro takes a lot time to load thousands of files Tue, 22 Sep, 15:00
Yana Kadiyska Re: Help getting started with Kafka Tue, 22 Sep, 15:29
juljoin Apache Spark job in local[*] is slower than regular 1-thread Python program Tue, 22 Sep, 15:37
Sean Owen Re: Remove duplicate keys by always choosing first in file. Tue, 22 Sep, 15:38
Adrian Tanase Re: Remove duplicate keys by always choosing first in file. Tue, 22 Sep, 15:38
Philip Weaver Re: Remove duplicate keys by always choosing first in file. Tue, 22 Sep, 15:39
Sean Owen Re: Remove duplicate keys by always choosing first in file. Tue, 22 Sep, 15:54
Ruslan Dautkhanov Re: Spark Web UI + NGINX Tue, 22 Sep, 15:59
Philip Weaver Re: Remove duplicate keys by always choosing first in file. Tue, 22 Sep, 16:07
Thúy Hằng Lê Re: Using Spark for portfolio manager app Tue, 22 Sep, 16:12
Saisai Shao Re: Py4j issue with Python Kafka Module Tue, 22 Sep, 16:25
Cheng Lian Re: spark + parquet + schema name and metadata Tue, 22 Sep, 16:37
Luciano Resende Re: SparkR - calling as.vector() with rdd dataframe causes error Tue, 22 Sep, 17:06
Pulasthi Supun Wickramasinghe Re: Creating BlockMatrix with java API Tue, 22 Sep, 17:20
Pedro Rodriguez Re: How to speed up MLlib LDA? Tue, 22 Sep, 17:30
Michael Armbrust Re: Count for select not matching count for group by Tue, 22 Sep, 17:46
Cheng Lian Re: spark + parquet + schema name and metadata Tue, 22 Sep, 17:54
Charles Earl Re: How to speed up MLlib LDA? Tue, 22 Sep, 17:57
Jo Sunad Re: Has anyone used the Twitter API for location filtering? Tue, 22 Sep, 18:05
Marko Asplund Re: How to speed up MLlib LDA? Tue, 22 Sep, 18:07
Deenar Toraskar Spark 1.5 UDAF ArrayType Tue, 22 Sep, 18:13
Michael Armbrust Re: Spark 1.5 UDAF ArrayType Tue, 22 Sep, 18:28
Clément Frison How to share memory in a broadcast between tasks in the same executor? Tue, 22 Sep, 18:42
Utkarsh Sengar Re: How to share memory in a broadcast between tasks in the same executor? Tue, 22 Sep, 18:53
jeff saremi pyspark question: create RDD from csr_matrix Tue, 22 Sep, 19:02
Shiv Kandavelu Spark as standalone or with Hadoop stack. Tue, 22 Sep, 19:25
Sean Owen Re: Spark as standalone or with Hadoop stack. Tue, 22 Sep, 19:31
Adrian Tanase Re: Deploying spark-streaming application on production Tue, 22 Sep, 19:34
Stuart Layton unsubscribe Tue, 22 Sep, 19:53
Michael Armbrust Re: Spark 1.5 UDAF ArrayType Tue, 22 Sep, 19:59
Ted Yu Re: Spark as standalone or with Hadoop stack. Tue, 22 Sep, 20:03
Kali.tumm...@gmail.com KafkaProducer using Cassandra as source Tue, 22 Sep, 20:14
Deenar Toraskar Re: Spark 1.5 UDAF ArrayType Tue, 22 Sep, 20:42
Deenar Toraskar Re: How to share memory in a broadcast between tasks in the same executor? Tue, 22 Sep, 21:35
XIANDI Partitions on RDDs Tue, 22 Sep, 22:41
Krishna Sankar HDP 2.3 support for Spark 1.5.x Tue, 22 Sep, 22:42
Jacek Laskowski SPARK_WORKER_INSTANCES was detected (set to '2')…This is deprecated in Spark 1.0+ Tue, 22 Sep, 22:57
Jacek Laskowski Re: Spark as standalone or with Hadoop stack. Tue, 22 Sep, 23:02
Richard Eggert Re: Partitions on RDDs Tue, 22 Sep, 23:12
Tathagata Das Re: Invalid checkpoint url Tue, 22 Sep, 23:23
Zhan Zhang Re: HDP 2.3 support for Spark 1.5.x Tue, 22 Sep, 23:31
Michal Čizmazia Re: WAL on S3 Tue, 22 Sep, 23:53
Tathagata Das Re: WAL on S3 Tue, 22 Sep, 23:57
Richard Eggert Re: Apache Spark job in local[*] is slower than regular 1-thread Python program Wed, 23 Sep, 00:02
Richard Eggert Re: Why is 1 executor overworked and other sit idle? Wed, 23 Sep, 00:08
Michal Čizmazia Re: WAL on S3 Wed, 23 Sep, 00:15
Bryan Jeffrey Yarn Shutting Down Spark Processing Wed, 23 Sep, 00:49
Tathagata Das Re: WAL on S3 Wed, 23 Sep, 01:09
fightf...@163.com Spark standalone/Mesos on top of Ceph Wed, 23 Sep, 01:28
Jerry Lam Re: Spark standalone/Mesos on top of Ceph Wed, 23 Sep, 01:37
Zhiliang Zhu how to submit the spark job outside the cluster Wed, 23 Sep, 01:37
fightf...@163.com Re: Re: Spark standalone/Mesos on top of Ceph Wed, 23 Sep, 01:59
tridib Re: Long GC pauses with Spark SQL 1.3.0 and billion row tables Wed, 23 Sep, 02:07
Zhan Zhang Re: how to submit the spark job outside the cluster Wed, 23 Sep, 02:20
Jerry Lam Re: Re: Spark standalone/Mesos on top of Ceph Wed, 23 Sep, 02:21
Andy Huang Parallel collection in driver programs Wed, 23 Sep, 02:39
Tathagata Das Re: Streaming Receiver Imbalance Problem Wed, 23 Sep, 02:40
SLiZn Liu Re: Streaming Receiver Imbalance Problem Wed, 23 Sep, 02:45
Zhiliang Zhu Re: how to submit the spark job outside the cluster Wed, 23 Sep, 02:49
Zhan Zhang Re: how to submit the spark job outside the cluster Wed, 23 Sep, 03:08
Zhiliang Zhu Re: how to submit the spark job outside the cluster Wed, 23 Sep, 03:14
Zhan Zhang Re: how to submit the spark job outside the cluster Wed, 23 Sep, 03:29
Michal Čizmazia Re: WAL on S3 Wed, 23 Sep, 03:35
madhvi.gupta Re: SparkR for accumulo Wed, 23 Sep, 03:41
Chirag Dewan RE: Why is 1 executor overworked and other sit idle? Wed, 23 Sep, 03:45
satish chandra j Scala Limitation - Case Class definition with more than 22 arguments Wed, 23 Sep, 03:48
Zhiliang Zhu Re: how to submit the spark job outside the cluster Wed, 23 Sep, 03:55
Ted Yu Re: Scala Limitation - Case Class definition with more than 22 arguments Wed, 23 Sep, 04:07
ayan guha Re: Py4j issue with Python Kafka Module Wed, 23 Sep, 04:10
Andy Huang Re: Scala Limitation - Case Class definition with more than 22 arguments Wed, 23 Sep, 04:20
srungarapu vamsi Re: Invalid checkpoint url Wed, 23 Sep, 04:41
Saisai Shao Re: Py4j issue with Python Kafka Module Wed, 23 Sep, 04:45
Jonathan Coveney Re: Apache Spark job in local[*] is slower than regular 1-thread Python program Wed, 23 Sep, 04:48
Sabarish Sasidharan Re: Creating BlockMatrix with java API Wed, 23 Sep, 04:55
Pulasthi Supun Wickramasinghe Re: Creating BlockMatrix with java API Wed, 23 Sep, 05:14
satish chandra j JdbcRDD Constructor Wed, 23 Sep, 05:30
Deenar Toraskar Re: spark-avro takes a lot time to load thousands of files Wed, 23 Sep, 05:35
Yashwanth Kumar Re: SparkR vs R Wed, 23 Sep, 05:39
Yashwanth Kumar Re: Partitions on RDDs Wed, 23 Sep, 05:43
swetha How to make Group By/reduceByKey more efficient? Wed, 23 Sep, 05:43
Tathagata Das Re: Streaming Receiver Imbalance Problem Wed, 23 Sep, 05:52
Tathagata Das Re: Py4j issue with Python Kafka Module Wed, 23 Sep, 05:54
Tathagata Das Re: Invalid checkpoint url Wed, 23 Sep, 05:55
Tathagata Das Re: WAL on S3 Wed, 23 Sep, 06:10
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · 22 · Next »Thread · Author · Date
Box list
Jun 2021148
May 2021187
Apr 2021267
Mar 2021346
Feb 2021166
Jan 2021242
Dec 2020203
Nov 2020147
Oct 2020236
Sep 2020136
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137