spark-user mailing list archives: May 2015

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · Next »Thread · Author · Date
Olivier Girardot Re: [SparkSQL 1.4.0] groupBy columns are always nullable? Mon, 18 May, 19:39
Davies Liu Re: pass configuration parameters to PySpark job Mon, 18 May, 21:05
Davies Liu Re: How to run multiple jobs in one sparkcontext from separate threads in pyspark? Mon, 18 May, 21:12
Imran Rashid Re: parallelism on binary file Mon, 18 May, 22:04
Tathagata Das Re: Spark groupByKey, does it always create at least 1 partition per key? Mon, 18 May, 22:24
Imran Rashid Re: Error communicating with MapOutputTracker Mon, 18 May, 22:25
Imran Rashid Re: applications are still in progress? Mon, 18 May, 22:39
Imran Rashid Re: com.esotericsoftware.kryo.KryoException: java.io.IOException: Stream is corrupted Mon, 18 May, 22:42
Bill Jay Partition number of Spark Streaming Kafka receiver-based approach Mon, 18 May, 23:46
Saisai Shao Re: Partition number of Spark Streaming Kafka receiver-based approach Tue, 19 May, 00:54
Imran Rashid Re: Spark on Yarn : Map outputs lifetime ? Tue, 19 May, 01:05
Joseph Bradley Re: Restricting the number of iterations in Mllib Kmeans Tue, 19 May, 01:24
Imran Rashid Re: Broadcast variables can be rebroadcast? Tue, 19 May, 01:24
Imran Rashid Re: spark log field clarification Tue, 19 May, 01:31
Joseph Bradley Re: Implementing custom metrics under MLPipeline's BinaryClassificationEvaluator Tue, 19 May, 01:35
Imran Rashid Re: FetchFailedException and MetadataFetchFailedException Tue, 19 May, 01:38
Imran Rashid Re: LogisticRegressionWithLBFGS with large feature set Tue, 19 May, 02:00
Justin Pihony TwitterUtils on Windows Tue, 19 May, 02:08
Justin Pihony Re: TwitterUtils on Windows Tue, 19 May, 02:44
Chandra Mohan, Ananda Vel Murugan RE: Spark sql error while writing Parquet file- Trying to write more fields than contained in row Tue, 19 May, 03:01
xiaohe lan Re: number of executors Tue, 19 May, 03:03
Sandy Ryza Re: number of executors Tue, 19 May, 03:06
xiaohe lan Re: number of executors Tue, 19 May, 03:17
Justin Pihony Re: TwitterUtils on Windows Tue, 19 May, 04:26
Dibyendu Bhattacharya Spark Streaming graceful shutdown in Spark 1.4 Tue, 19 May, 04:43
Fengyun RAO Re: --jars works in "yarn-client" but not "yarn-cluster" mode, why? Tue, 19 May, 06:14
Tathagata Das Re: Spark Streaming graceful shutdown in Spark 1.4 Tue, 19 May, 06:28
Tathagata Das Re: [SparkStreaming] Is it possible to delay the start of some DStream in the application? Tue, 19 May, 06:31
Dibyendu Bhattacharya Re: Spark Streaming graceful shutdown in Spark 1.4 Tue, 19 May, 06:59
Akhil Das Re: TwitterUtils on Windows Tue, 19 May, 07:05
Dibyendu Bhattacharya Re: Spark Streaming graceful shutdown in Spark 1.4 Tue, 19 May, 07:06
Akhil Das Re: org.apache.spark.shuffle.FetchFailedException :: Migration from Spark 1.2 to 1.3 Tue, 19 May, 07:17
Peer, Oded group by and distinct performance issue Tue, 19 May, 07:28
Sean Owen Re: Spark Streaming graceful shutdown in Spark 1.4 Tue, 19 May, 07:33
Dibyendu Bhattacharya Re: Spark Streaming graceful shutdown in Spark 1.4 Tue, 19 May, 07:51
Shushant Arora spark streaming doubt Tue, 19 May, 07:53
Akhil Das Re: group by and distinct performance issue Tue, 19 May, 07:56
Akhil Das Re: spark streaming doubt Tue, 19 May, 08:05
Pa Rö Re: Spark and Flink Tue, 19 May, 08:06
N B Re: Broadcast variables can be rebroadcast? Tue, 19 May, 08:06
Steve Loughran Re: TwitterUtils on Windows Tue, 19 May, 08:20
Guillermo Ortiz Re: Working with slides. How do I know how many times a RDD has been processed? Tue, 19 May, 08:22
Night Wolf Spark 1.3.1 Performance Tuning/Patterns for Data Generation Heavy/Throughput Jobs Tue, 19 May, 08:36
Ewan Leith AvroParquetWriter equivalent in Spark 1.3 sqlContext Save or createDataFrame Interfaces? Tue, 19 May, 08:42
Shay Rojansky Re: py-files (and others?) not properly set up in cluster-mode Spark Yarn job? Tue, 19 May, 08:56
Akhil Das Re: spark streaming doubt Tue, 19 May, 09:08
donhoff_h How to use spark to access HBase with Security enabled Tue, 19 May, 09:41
Evo Eftimov RE: Spark 1.3.1 Performance Tuning/Patterns for Data Generation Heavy/Throughput Jobs Tue, 19 May, 09:49
Cheng Lian Re: AvroParquetWriter equivalent in Spark 1.3 sqlContext Save or createDataFrame Interfaces? Tue, 19 May, 10:00
madhu phatak Spark SQL on large number of columns Tue, 19 May, 10:04
Ewan Leith RE: AvroParquetWriter equivalent in Spark 1.3 sqlContext Save or createDataFrame Interfaces? Tue, 19 May, 10:07
Tapan Sharma Reading Binary files in Spark program Tue, 19 May, 10:27
ayan guha Re: Spark SQL on large number of columns Tue, 19 May, 10:29
madhu phatak Re: Spark SQL on large number of columns Tue, 19 May, 10:35
madhu phatak Re: Spark SQL on large number of columns Tue, 19 May, 10:53
Cheng Lian Re: AvroParquetWriter equivalent in Spark 1.3 sqlContext Save or createDataFrame Interfaces? Tue, 19 May, 10:58
Ewan Leith RE: AvroParquetWriter equivalent in Spark 1.3 sqlContext Save or createDataFrame Interfaces? Tue, 19 May, 10:59
Wangfei (X) Re: Spark SQL on large number of columns Tue, 19 May, 11:04
madhu phatak Re: Spark SQL on large number of columns Tue, 19 May, 11:05
Ted Yu Re: How to use spark to access HBase with Security enabled Tue, 19 May, 11:55
donhoff_h 回复: How to use spark to access HBase with Security enabled Tue, 19 May, 12:23
madhu phatak Re: Spark SQL on large number of columns Tue, 19 May, 12:27
Keerthi RE: Decision tree: categorical variables Tue, 19 May, 12:45
Todd Nist Re: Spark sql error while writing Parquet file- Trying to write more fields than contained in row Tue, 19 May, 12:46
madhu phatak Re: Spark SQL on large number of columns Tue, 19 May, 12:53
Akhil Das Re: Reading Binary files in Spark program Tue, 19 May, 12:56
Todd Nist Re: group by and distinct performance issue Tue, 19 May, 13:03
Cody Koeninger Re: Reading Real Time Data only from Kafka Tue, 19 May, 13:13
Till Rohrmann Re: Spark and Flink Tue, 19 May, 13:15
Akhil Das Re: Reading Real Time Data only from Kafka Tue, 19 May, 13:23
Imran Rashid Re: Broadcast variables can be rebroadcast? Tue, 19 May, 13:25
Heisenberg Bb Hive in IntelliJ Tue, 19 May, 13:26
Imran Rashid Re: org.apache.spark.shuffle.FetchFailedException :: Migration from Spark 1.2 to 1.3 Tue, 19 May, 13:34
Ted Yu Re: How to use spark to access HBase with Security enabled Tue, 19 May, 13:54
madhu phatak Re: Spark SQL on large number of columns Tue, 19 May, 14:20
Shushant Arora Re: spark streaming doubt Tue, 19 May, 14:40
YaoPau Does Python 2.7 have to be installed on every cluster node? Tue, 19 May, 14:44
Muralidhar, Nikhil Re: PySpark Job throwing IOError Tue, 19 May, 14:59
Tomasz Fruboes Multi user setup and saving a DataFrame / RDD to a network exported file system Tue, 19 May, 15:15
Akhil Das Re: spark streaming doubt Tue, 19 May, 15:30
Justin Pihony Windows DOS bug in windows-utils.cmd Tue, 19 May, 15:41
Panagiotis Garefalakis Mesos Spark Tasks - Lost Tue, 19 May, 15:57
Dibyendu Bhattacharya Re: spark streaming doubt Tue, 19 May, 15:59
Ram Sriharsha Re: Decision tree: categorical variables Tue, 19 May, 15:59
N B Re: Broadcast variables can be rebroadcast? Tue, 19 May, 16:32
Thomas Dudziak Wish for 1.4: upper bound on # tasks in Mesos Tue, 19 May, 16:39
Cyrus Handy PanTera Big Data Visualization built with Spark Tue, 19 May, 17:05
Matei Zaharia Re: Wish for 1.4: upper bound on # tasks in Mesos Tue, 19 May, 17:05
Thomas Dudziak Re: Wish for 1.4: upper bound on # tasks in Mesos Tue, 19 May, 17:11
Shushant Arora Re: spark streaming doubt Tue, 19 May, 17:27
Joseph Bradley Re: Getting the best parameter set back from CrossValidatorModel Tue, 19 May, 17:31
Matei Zaharia Re: Wish for 1.4: upper bound on # tasks in Mesos Tue, 19 May, 17:34
Bill Jay Spark Streaming + Kafka failure recovery Tue, 19 May, 17:42
Cody Koeninger Re: Spark Streaming + Kafka failure recovery Tue, 19 May, 17:58
fdmitriy Problem querying RDD using HiveThriftServer2.startWithContext functionality Tue, 19 May, 18:00
Ricardo Goncalves da Silva Code error Tue, 19 May, 18:59
Xiangrui Meng Re: MLlib libsvm isssues with data Tue, 19 May, 19:04
Xiangrui Meng Re: RandomSplit with Spark-ML and Dataframe Tue, 19 May, 19:08
Xiangrui Meng Re: User Defined Type (UDT) Tue, 19 May, 19:13
Stephen Boesch Re: Code error Tue, 19 May, 19:23
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · Next »Thread · Author · Date
Box list
Jun 2021137
May 2021187
Apr 2021267
Mar 2021346
Feb 2021166
Jan 2021242
Dec 2020203
Nov 2020147
Oct 2020236
Sep 2020136
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137