spark-user mailing list archives: October 2015

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · Next »Thread · Author · Date
Kristina Rogale Plazonic Where to put import sqlContext.implicits._ to be able to work on DataFrames in another file? Mon, 05 Oct, 17:05
Kristina Rogale Plazonic Re: Spark SQL running totals Thu, 15 Oct, 18:56
Krot Viacheslav correct and fast way to stop streaming application Mon, 26 Oct, 15:28
Krot Viacheslav Re: correct and fast way to stop streaming application Tue, 27 Oct, 09:13
Krot Viacheslav Re: correct and fast way to stop streaming application Tue, 27 Oct, 17:47
Krzysztof Zarzycki Re: Store DStreams into Hive using Hive Streaming Mon, 05 Oct, 09:21
Krzysztof Zarzycki Re: Notification on Spark Streaming job failure Wed, 07 Oct, 05:28
Lan Jiang How to access lost executor log file Thu, 01 Oct, 17:30
Lan Jiang Re: How to access lost executor log file Thu, 01 Oct, 18:46
Lan Jiang "java.io.IOException: Filesystem closed" on executors Thu, 01 Oct, 19:41
Lan Jiang Re: "java.io.IOException: Filesystem closed" on executors Mon, 05 Oct, 15:25
Lan Jiang Spark cache memory storage Tue, 06 Oct, 19:15
Lan Jiang failed spark job reports on YARN as successful Thu, 08 Oct, 18:16
Lan Jiang Re: How to increase Spark partitions for the DataFrame? Thu, 08 Oct, 18:25
Lan Jiang Re: How to increase Spark partitions for the DataFrame? Thu, 08 Oct, 19:13
Lan Jiang Re: "java.io.IOException: Filesystem closed" on executors Thu, 15 Oct, 01:10
Lan Jiang Re: Ahhhh... Spark creates >30000 partitions... What can I do? Tue, 20 Oct, 14:04
Lan Jiang Re: Ahhhh... Spark creates >30000 partitions... What can I do? Tue, 20 Oct, 22:02
Langston, Jim localhost webui port Tue, 13 Oct, 12:47
Lars Albertsson Re: [SPARK STREAMING] polling based operation instead of event based operation Fri, 23 Oct, 08:05
Lei Wu TaskMemoryManager. cleanUpAllAllocatedMemory -> Memory leaks ??? Mon, 12 Oct, 07:28
Lei Wu [No Subject] Thu, 15 Oct, 08:34
Lei Wu Design doc for Spark task scheduling Thu, 15 Oct, 08:38
Lij Tapel Re: new 1.5.1 behavior - exception on executor throws ClassNotFound on driver Mon, 19 Oct, 18:26
Lij Tapel Re: new 1.5.1 behavior - exception on executor throws ClassNotFound on driver Mon, 19 Oct, 19:02
Lij Tapel Re: new 1.5.1 behavior - exception on executor throws ClassNotFound on driver Mon, 19 Oct, 20:23
Lin Zhao "Failed to bind to" error with spark-shell on CDH5 and YARN Fri, 23 Oct, 23:46
Lin Zhao Re: "Failed to bind to" error with spark-shell on CDH5 and YARN Sun, 25 Oct, 21:26
LinQili Is there a way to create multiple streams in spark streaming? Tue, 20 Oct, 10:20
Luciano Resende Re: SF Spark Office Hours Experiment - Friday Afternoon Wed, 21 Oct, 20:18
Luciano Resende Re: spark streaming 1.51. uses very old version of twitter4j Wed, 21 Oct, 21:14
MEETHU MATHEW Re: Define new stage in pipeline Wed, 21 Oct, 04:30
Maheshakya Wijewardena Using a variable (a column name) in an IF statement in Spark SQL Thu, 08 Oct, 13:13
Maheshakya Wijewardena Re: Using a variable (a column name) in an IF statement in Spark SQL Fri, 09 Oct, 02:28
Marcelo Vanzin Re: Pyspark: "Error: No main class set in JAR; please specify one with --class" Thu, 01 Oct, 17:07
Marcelo Vanzin Re: How does FAIR job scheduler work in Standalone cluster mode? Sat, 03 Oct, 00:20
Marcelo Vanzin Re: How does FAIR job scheduler work in Standalone cluster mode? Sat, 03 Oct, 00:48
Marcelo Vanzin Re: compatibility issue with Jersey2 Tue, 06 Oct, 18:40
Marcelo Vanzin Re: compatibility issue with Jersey2 Tue, 06 Oct, 19:20
Marcelo Vanzin Re: compatibility issue with Jersey2 Wed, 07 Oct, 18:26
Marcelo Vanzin Re: Spark shuffle service does not work in stand alone Tue, 13 Oct, 16:13
Marcelo Vanzin Re: Spark shuffle service does not work in stand alone Tue, 13 Oct, 17:35
Marcelo Vanzin Re: Programmatically connect to remote YARN in yarn-client mode Wed, 14 Oct, 17:13
Marcelo Vanzin Re: Programmatically connect to remote YARN in yarn-client mode Wed, 14 Oct, 17:32
Marcelo Vanzin Re: [Spark-SQL]: Unable to propagate hadoop configuration after SparkContext is initialized Tue, 27 Oct, 18:05
Marcelo Vanzin Re: [Spark-SQL]: Unable to propagate hadoop configuration after SparkContext is initialized Tue, 27 Oct, 18:30
Marco Mistroni Problem installing Sparck on Windows 8 Mon, 12 Oct, 22:11
Marco Mistroni Re: Problem installing Sparck on Windows 8 Wed, 14 Oct, 19:56
Marco Mistroni Re: Problem installing Sparck on Windows 8 Thu, 15 Oct, 22:40
Marco Mistroni Re: Problem installing Sparck on Windows 8 Sat, 17 Oct, 21:51
Mark Bonnekessel Apache Spark on Raspberry Pi Cluster with Docker Wed, 28 Oct, 13:20
Mark Bonnekessel Apache Spark on Raspberry Pi Cluster with Docker Wed, 28 Oct, 14:23
Mark Hamstra Re: foreachPartition Fri, 30 Oct, 23:45
Mark Luk Re: Worker node timeout exception Thu, 01 Oct, 07:00
Mark Vervuurt Spark-Testing-Base Q/A Wed, 21 Oct, 10:16
Mark Vervuurt Re: Spark-Testing-Base Q/A Wed, 21 Oct, 11:37
Martin Senne Why is no predicate pushdown performed, when using Hive (HiveThriftServer2) ? Wed, 28 Oct, 12:32
Martin Senne Sorry, but Nabble and ML suck Sat, 31 Oct, 16:19
Martin Senne Re: Sorry, but Nabble and ML suck Sat, 31 Oct, 16:40
Martin Senne Re: Sorry, but Nabble and ML suck Sat, 31 Oct, 16:43
Martin Senne Why does predicate pushdown not work on HiveContext (concrete HiveThriftServer2) ? Sat, 31 Oct, 16:50
Matei Zaharia Re: How to compile Spark with customized Hadoop? Fri, 09 Oct, 21:31
Matej Holec How to set memory for SparkR with master="local[*]" Fri, 23 Oct, 11:43
Matt Narrell Re: laziness in textFile reading from HDFS? Sun, 04 Oct, 04:50
Matt Narrell Re: laziness in textFile reading from HDFS? Tue, 06 Oct, 22:32
Matt Narrell Re: laziness in textFile reading from HDFS? Tue, 06 Oct, 23:08
Matthias Niehoff Dynamic Resource Allocation with Spark Streaming (Standalone Cluster, Spark 1.5.1) Mon, 26 Oct, 20:00
Meihua Wu Re: [SPARK MLLIB] could not understand the wrong and inscrutable result of Linear Regression codes Mon, 26 Oct, 05:48
Meihua Wu Spark Implementation of XGBoost Mon, 26 Oct, 18:42
Meihua Wu Re: Spark Implementation of XGBoost Tue, 27 Oct, 03:37
Meihua Wu Re: Spark Implementation of XGBoost Tue, 27 Oct, 03:46
Meihua Wu Re: Spark Implementation of XGBoost Wed, 28 Oct, 00:04
Michael Albert are functions deserialized once per task? Fri, 02 Oct, 16:33
Michael Armbrust Re: How to use registered Hive UDF in Spark DataFrame? Fri, 02 Oct, 14:53
Michael Armbrust Re: SparkSQL: Reading data from hdfs and storing into multiple paths Fri, 02 Oct, 14:54
Michael Armbrust Re: How to use registered Hive UDF in Spark DataFrame? Fri, 02 Oct, 21:06
Michael Armbrust Re: performance difference between Thrift server and SparkSQL? Sat, 03 Oct, 20:26
Michael Armbrust Re: "Method json([class java.util.HashMap]) does not exist" when reading JSON on PySpark Mon, 05 Oct, 19:44
Michael Armbrust Re: String operation in filter with a special character Mon, 05 Oct, 19:50
Michael Armbrust Re: Spark context on thrift server Mon, 05 Oct, 19:52
Michael Armbrust Re: "Method json([class java.util.HashMap]) does not exist" when reading JSON on PySpark Mon, 05 Oct, 20:25
Michael Armbrust Re: Spark SQL "SELECT ... LIMIT" scans the entire Hive table? Mon, 05 Oct, 21:35
Michael Armbrust Re: ORC files created by Spark job can't be accessed using hive table Tue, 06 Oct, 17:24
Michael Armbrust Re: Does feature parity exist between Spark and PySpark Wed, 07 Oct, 17:03
Michael Armbrust Re: SparkSQL: First query execution is always slower than subsequent queries Thu, 08 Oct, 01:47
Michael Armbrust Re: Using Sqark SQL mapping over an RDD Thu, 08 Oct, 17:16
Michael Armbrust Re: Using a variable (a column name) in an IF statement in Spark SQL Thu, 08 Oct, 17:18
Michael Armbrust Re: Default size of a datatype in SparkSQL Thu, 08 Oct, 17:19
Michael Armbrust Re: Using Sqark SQL mapping over an RDD Thu, 08 Oct, 17:51
Michael Armbrust Re: How to register udf with Any or generic Type in spark Thu, 08 Oct, 17:52
Michael Armbrust Re: Dataframes - sole data structure for parallel computations? Thu, 08 Oct, 17:56
Michael Armbrust Re: RowNumber in HiveContext returns null or negative values Thu, 08 Oct, 17:57
Michael Armbrust Re: RowNumber in HiveContext returns null or negative values Thu, 08 Oct, 18:31
Michael Armbrust Re: Using a variable (a column name) in an IF statement in Spark SQL Fri, 09 Oct, 18:52
Michael Armbrust Re: error in sparkSQL 1.5 using count(1) in nested queries Fri, 09 Oct, 19:01
Michael Armbrust Re: How to calculate percentile of a column of DataFrame? Fri, 09 Oct, 19:03
Michael Armbrust Re: How to calculate percentile of a column of DataFrame? Fri, 09 Oct, 19:50
Michael Armbrust Re: Spark DataFrame GroupBy into List Tue, 13 Oct, 18:11
Michael Armbrust Re: Question about data frame partitioning in Spark 1.3.0 Wed, 14 Oct, 17:11
Michael Armbrust Re: Spark DataFrame GroupBy into List Wed, 14 Oct, 17:15
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · Next »Thread · Author · Date
Box list
Nov 201955
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137