spark-user mailing list archives: November 2016

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · Next »Thread · Author · Date
Shreya Agarwal RE: Re:RE: how to merge dataframe write output files Thu, 10 Nov, 08:28
Elkhan Dadashov Re: SparkLauncer 2.0.1 version working incosistently in yarn-client mode Thu, 10 Nov, 08:31
Shreya Agarwal RE: Strongly Connected Components Thu, 10 Nov, 08:44
Mendelson, Assaf RE: Re:RE: how to merge dataframe write output files Thu, 10 Nov, 08:49
Mich Talebzadeh Re: importing data into hdfs/spark using Informatica ETL tool Thu, 10 Nov, 09:01
Cody Koeninger Re: Akka Stream as the source for Spark Streaming. Please advice... Thu, 10 Nov, 13:47
Prashant Sharma Re: If we run sc.textfile(path,xxx) many times, will the elements be the same in each partition Thu, 10 Nov, 14:20
codlife will spark aggregate and treeaggregate case a shuflle action? Thu, 10 Nov, 15:45
Ivan von Nagy Re: Instability issues with Spark 2.0.1 and Kafka 0.10 Thu, 10 Nov, 16:26
Manish Malhotra Spark Streaming: question on sticky session across batches ? Thu, 10 Nov, 16:42
Yang type-safe join in the new DataSet API? Thu, 10 Nov, 18:44
Perttu Ranta-aho UDF with column value comparison fails with PySpark Thu, 10 Nov, 19:14
Michael Armbrust Re: type-safe join in the new DataSet API? Thu, 10 Nov, 19:18
Davies Liu Re: UDF with column value comparison fails with PySpark Thu, 10 Nov, 19:19
Perttu Ranta-aho Re: UDF with column value comparison fails with PySpark Thu, 10 Nov, 19:47
KhajaAsmath Mohammed Re: Access_Remote_Kerberized_Cluster_Through_Spark Thu, 10 Nov, 20:11
shyla deshpande Anyone using ProtoBuf for Kafka messages with Spark Streaming for processing? Thu, 10 Nov, 20:20
Mohammad Tariq Re: Correct SparkLauncher usage Thu, 10 Nov, 22:43
Stuart White Joining to a large, pre-sorted file Thu, 10 Nov, 22:45
Marcelo Vanzin Re: Correct SparkLauncher usage Thu, 10 Nov, 22:49
Mohammad Tariq Re: Correct SparkLauncher usage Thu, 10 Nov, 22:57
Mohammad Tariq Re: Correct SparkLauncher usage Thu, 10 Nov, 23:00
Marcelo Vanzin Re: Correct SparkLauncher usage Thu, 10 Nov, 23:05
Mohammad Tariq Re: Correct SparkLauncher usage Thu, 10 Nov, 23:07
Jörn Franke Re: Joining to a large, pre-sorted file Fri, 11 Nov, 01:33
Stuart White Re: Joining to a large, pre-sorted file Fri, 11 Nov, 01:39
Shixiong(Ryan) Zhu Re: Instability issues with Spark 2.0.1 and Kafka 0.10 Fri, 11 Nov, 01:43
Silvio Fiorito Re: Joining to a large, pre-sorted file Fri, 11 Nov, 02:14
jggg777 Re: Newbie question - Best way to bootstrap with Spark Fri, 11 Nov, 02:55
Felix Cheung Re: Strongly Connected Components Fri, 11 Nov, 03:49
Shreya Agarwal RE: Strongly Connected Components Fri, 11 Nov, 04:15
Jorge Sánchez Re: how to merge dataframe write output files Fri, 11 Nov, 07:45
Sean Owen Re: TallSkinnyQR Fri, 11 Nov, 11:55
Xiaomeng Wan load large number of files from s3 Fri, 11 Nov, 13:08
Shawn Wan load large number of files from s3 Fri, 11 Nov, 13:12
Mich Talebzadeh Possible DR solution Fri, 11 Nov, 14:56
kant kodali How to use Spark SQL to connect to Cassandra from Spark-Shell? Fri, 11 Nov, 16:04
Yong Zhang Re: How to use Spark SQL to connect to Cassandra from Spark-Shell? Fri, 11 Nov, 16:07
kant kodali Re: How to use Spark SQL to connect to Cassandra from Spark-Shell? Fri, 11 Nov, 16:11
kant kodali Re: How to use Spark SQL to connect to Cassandra from Spark-Shell? Fri, 11 Nov, 16:14
Raghav Kafka Producer within a docker Instance Fri, 11 Nov, 16:19
Mudit Kumar RE: Possible DR solution Fri, 11 Nov, 16:43
kant kodali Re: How to use Spark SQL to connect to Cassandra from Spark-Shell? Fri, 11 Nov, 16:51
Russell Spitzer Re: How to use Spark SQL to connect to Cassandra from Spark-Shell? Fri, 11 Nov, 17:09
Mich Talebzadeh Re: Possible DR solution Fri, 11 Nov, 17:10
Deepak Sharma Re: Possible DR solution Fri, 11 Nov, 17:11
Mich Talebzadeh Re: Possible DR solution Fri, 11 Nov, 17:12
Deepak Sharma Re: Possible DR solution Fri, 11 Nov, 17:14
Aniket Bhatnagar Dataset API | Setting number of partitions during join/groupBy Fri, 11 Nov, 17:22
Mich Talebzadeh Re: Possible DR solution Fri, 11 Nov, 17:24
Gerard Casey RDD to HDFS - Kerberos - authentication error - RetryInvocationHandler Fri, 11 Nov, 17:48
Anil Langote DataSet is not able to handle 50,000 columns to sum Fri, 11 Nov, 17:57
Iman Mohtashemi Re: TallSkinnyQR Fri, 11 Nov, 17:59
Shreya Agarwal RE: Dataset API | Setting number of partitions during join/groupBy Fri, 11 Nov, 18:27
Nicholas Sharkey Finding a Spark Equivalent for Pandas' get_dummies Fri, 11 Nov, 18:27
nsharkey Finding a Spark Equivalent for Pandas' get_dummies Fri, 11 Nov, 18:32
Aniket Bhatnagar Re: Dataset API | Setting number of partitions during join/groupBy Fri, 11 Nov, 18:35
Shreya Agarwal RE: Strongly Connected Components Fri, 11 Nov, 18:39
Nick Pentreath Re: Finding a Spark Equivalent for Pandas' get_dummies Fri, 11 Nov, 19:00
Cody Koeninger Re: Instability issues with Spark 2.0.1 and Kafka 0.10 Fri, 11 Nov, 20:12
Nicholas Sharkey Re: Finding a Spark Equivalent for Pandas' get_dummies Fri, 11 Nov, 21:21
Mich Talebzadeh Re: Possible DR solution Fri, 11 Nov, 22:19
Elkhan Dadashov appHandle.kill(), SparkSubmit Process, JVM questions related to SparkLauncher design and Spark Driver Fri, 11 Nov, 22:49
ayan guha Re: DataSet is not able to handle 50,000 columns to sum Sat, 12 Nov, 00:10
SamPenrose pyspark: accept unicode column names in DataFrame.corr and cov Sat, 12 Nov, 00:36
Daniel Darabos Re: Strongly Connected Components Sat, 12 Nov, 00:58
Elkhan Dadashov SparkDriver memory calculation mismatch Sat, 12 Nov, 02:18
Elkhan Dadashov Exception not failing Python applications (in yarn client mode) - SparkLauncher says app succeeded, where app actually has failed Sat, 12 Nov, 03:32
Anil Langote Re: DataSet is not able to handle 50,000 columns to sum Sat, 12 Nov, 03:33
Shreya Agarwal RE: Strongly Connected Components Sat, 12 Nov, 03:39
Sean Owen Re: SparkDriver memory calculation mismatch Sat, 12 Nov, 08:24
Elkhan Dadashov Re: SparkDriver memory calculation mismatch Sat, 12 Nov, 09:13
Sean Owen Re: SparkDriver memory calculation mismatch Sat, 12 Nov, 09:40
vincent gromakowski Re: Possible DR solution Sat, 12 Nov, 09:52
Elkhan Dadashov Re: SparkDriver memory calculation mismatch Sat, 12 Nov, 09:59
Elkhan Dadashov Re: Correct SparkLauncher usage Sat, 12 Nov, 10:26
ayan guha Re: Exception not failing Python applications (in yarn client mode) - SparkLauncher says app succeeded, where app actually has failed Sat, 12 Nov, 11:00
Mich Talebzadeh Re: Possible DR solution Sat, 12 Nov, 11:04
Rohit Verma Spark joins using row id Sat, 12 Nov, 11:11
Jörn Franke Re: Possible DR solution Sat, 12 Nov, 11:17
Hyukjin Kwon Re: pyspark: accept unicode column names in DataFrame.corr and cov Sat, 12 Nov, 11:43
dev loper Spark Streaming- ReduceByKey not removing Duplicates for the same key in a Batch Sat, 12 Nov, 12:36
Stuart White Re: Joining to a large, pre-sorted file Sat, 12 Nov, 13:40
Rohit Verma Re: Spark joins using row id Sat, 12 Nov, 13:54
Jacek Laskowski Re: Akka Stream as the source for Spark Streaming. Please advice... Sat, 12 Nov, 14:43
Mich Talebzadeh Re: Possible DR solution Sat, 12 Nov, 15:05
Luciano Resende Re: Akka Stream as the source for Spark Streaming. Please advice... Sat, 12 Nov, 15:07
Shushant Arora spark streaming with kinesis Sat, 12 Nov, 15:08
Silvio Fiorito Re: Joining to a large, pre-sorted file Sat, 12 Nov, 15:34
Stuart White Re: Joining to a large, pre-sorted file Sat, 12 Nov, 16:20
Cody Koeninger Re: Spark Streaming- ReduceByKey not removing Duplicates for the same key in a Batch Sat, 12 Nov, 16:25
dev loper Re: Spark Streaming- ReduceByKey not removing Duplicates for the same key in a Batch Sat, 12 Nov, 16:29
Jacek Laskowski Re: Akka Stream as the source for Spark Streaming. Please advice... Sat, 12 Nov, 16:42
Timur Shenkao Re: Possible DR solution Sat, 12 Nov, 17:17
Ivan von Nagy Re: Instability issues with Spark 2.0.1 and Kafka 0.10 Sat, 12 Nov, 18:14
deepak.subhramanian Re: Possible DR solution Sat, 12 Nov, 19:27
Sean McKibben Re: Instability issues with Spark 2.0.1 and Kafka 0.10 Sat, 12 Nov, 19:46
Koert Kuipers Re: Strongly Connected Components Sat, 12 Nov, 19:48
Koert Kuipers Re: Strongly Connected Components Sat, 12 Nov, 20:01
Ivan von Nagy Re: Instability issues with Spark 2.0.1 and Kafka 0.10 Sat, 12 Nov, 20:15
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · Next »Thread · Author · Date
Box list
Jun 202198
May 2021187
Apr 2021267
Mar 2021346
Feb 2021166
Jan 2021242
Dec 2020203
Nov 2020147
Oct 2020236
Sep 2020136
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137