spark-user mailing list archives: July 2016

Site index · List index
Message list1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · Next »Thread · Author · Date
Sea Bug about reading parquet files Fri, 08 Jul, 08:33
cj read parquetfile in spark-sql error Mon, 25 Jul, 11:08
cj Re: read parquetfile in spark-sql error Tue, 26 Jul, 04:11
ناهید بهجتی نجف آبادی spark Wed, 27 Jul, 10:45
Ismaël Mejía Re: SparkWebUI and Master URL on EC2 Fri, 22 Jul, 05:52
Bruckwald Tamás Read Kafka topic in a Spark batch job Tue, 05 Jul, 12:15
Bruckwald Tamás Re: Read Kafka topic in a Spark batch job Tue, 05 Jul, 12:40
Christophe Préaud Re: Inode for STS Wed, 13 Jul, 07:25
Maciej Bryński Re: transtition SQLContext to SparkSession Tue, 19 Jul, 16:47
Maciej Bryński Re: MultiThreading in Spark 1.6.0 Wed, 20 Jul, 21:52
Maximiliano Patricio Méndez Problems initializing SparkUI Thu, 28 Jul, 21:37
Mikael Ståldal Why is KafkaUtils.createRDD offsetRanges an Array rather than a Seq? Fri, 08 Jul, 09:42
Mikael Ståldal Is the operation inside foreachRDD supposed to be blocking? Fri, 08 Jul, 09:43
Mikael Ståldal Re: Is the operation inside foreachRDD supposed to be blocking? Fri, 08 Jul, 12:56
Mikael Ståldal Re: Is the operation inside foreachRDD supposed to be blocking? Fri, 08 Jul, 13:26
Sergio Fernández ml models distribution Fri, 22 Jul, 09:49
Sergio Fernández Re: ml models distribution Fri, 22 Jul, 13:04
Yann-Aël Le Borgne Spark R 2.0 dapply very slow Sun, 31 Jul, 13:14
Sea 回复: Bug about reading parquet files Fri, 08 Jul, 12:44
Sea 回复: Spark hangs at "Removed broadcast_*" Wed, 13 Jul, 03:04
cj 回复: read parquetfile in spark-sql error Tue, 26 Jul, 04:13
focus Re:run spark apps in linux crontab Wed, 20 Jul, 10:11
另一片天 where is open source Distributed service framework use for spark?? Wed, 06 Jul, 06:24
喜之郎 回复: Enforcing shuffle hash join Tue, 05 Jul, 08:59
接立骞 [SPARK-3586][streaming]Support nested directories in Spark Sat, 30 Jul, 17:26
陆巍|Wei Lu(RD) many 'activity' job are pending Fri, 15 Jul, 16:17
Jörn Franke Re: Processing json document Thu, 07 Jul, 06:42
Jörn Franke Re: Memory grows exponentially Fri, 08 Jul, 07:44
Jörn Franke Re: Processing json document Fri, 08 Jul, 07:54
Jörn Franke Re: Using Spark on Hive with Hive also using Spark as its execution engine Mon, 11 Jul, 15:59
Jörn Franke Re: Using Spark on Hive with Hive also using Spark as its execution engine Tue, 12 Jul, 12:31
Jörn Franke Re: Custom InputFormat (SequenceFileInputFormat vs FileInputFormat) Fri, 15 Jul, 23:01
Jörn Franke Re: Little idea needed Tue, 19 Jul, 19:57
Jörn Franke Re: ORC v/s Parquet for Spark 2.0 Tue, 26 Jul, 09:09
Jörn Franke Re: ORC v/s Parquet for Spark 2.0 Wed, 27 Jul, 13:31
Jörn Franke Re: ORC v/s Parquet for Spark 2.0 Wed, 27 Jul, 20:30
Jörn Franke Re: ORC v/s Parquet for Spark 2.0 Thu, 28 Jul, 08:18
Jörn Franke Re: Custom Image RDD and Sequence Files Fri, 29 Jul, 05:46
A.W. Covert III PySpark 2.0 Structured Streaming Question Wed, 20 Jul, 20:22
AC24 SparkR | Exception in invokeJava: SparkR + Windows standalone cluster Wed, 06 Jul, 14:08
ANDREA SPINA Re: Issue with Spark on 25 nodes cluster Wed, 13 Jul, 09:20
Aakash Basu Little idea needed Tue, 19 Jul, 19:27
Aakash Basu Re: Little idea needed Wed, 20 Jul, 20:23
Aakash Basu Re: Little idea needed Wed, 20 Jul, 20:26
Aaron Ilovici Re: Error in running JavaALSExample example from spark examples Fri, 22 Jul, 17:34
Aaron Ilovici Re: Error in running JavaALSExample example from spark examples Fri, 22 Jul, 18:01
Aaron Jackson Heavy Stage Concentration - Ends With Failure Wed, 20 Jul, 00:16
Abhishek Anand Concatenate the columns in dataframe to create new collumns using Java Mon, 18 Jul, 10:45
Abhishek Anand Re: Concatenate the columns in dataframe to create new collumns using Java Mon, 18 Jul, 12:14
Abhishek Anand Re: Concatenate the columns in dataframe to create new collumns using Java Mon, 18 Jul, 13:23
Ajay Srivastava SPARK-8813 - combining small files in spark sql Thu, 07 Jul, 06:53
Ajinkya Kale Saving a pyspark.ml.feature.PCA model Tue, 19 Jul, 22:54
Ajinkya Kale Re: Saving a pyspark.ml.feature.PCA model Wed, 20 Jul, 03:22
Ajinkya Kale Re: Saving a pyspark.ml.feature.PCA model Wed, 20 Jul, 18:14
Akhil Das Re: Remote RPC client disassociated Fri, 01 Jul, 10:38
Akhil Das Re: RDD to DataFrame question with JsValue in the mix Fri, 01 Jul, 10:42
Akhil Das Re: How to spin up Kafka using docker and use for Spark Streaming Integration tests Fri, 01 Jul, 10:46
Akhil Das Re: Remote RPC client disassociated Fri, 01 Jul, 10:55
Akmal Abbasov How Spark HA works Tue, 05 Jul, 08:34
Alex Nastetsky spark sql aggregate function "Nth" Tue, 26 Jul, 14:57
Alex Nastetsky Re: spark sql aggregate function "Nth" Tue, 26 Jul, 16:05
Alexander Pivovarov Re: ORC v/s Parquet for Spark 2.0 Thu, 28 Jul, 22:15
Alexey Pechorin Re: Ideas to put a Spark ML model in production Sun, 03 Jul, 09:06
Ameen Akel Spark 2.0.0 RC 5 -- java.lang.AssertionError: assertion failed: Block rdd_[*] is not locked for reading Sun, 24 Jul, 17:00
Amit Dutta Call http request from within Spark Thu, 14 Jul, 14:52
Amit Sela Re: Aggregator (Spark 2.0) skips aggregation is zero(0 returns null Fri, 01 Jul, 21:04
Amit Sela init() and cleanup() for Spark map functions Thu, 21 Jul, 14:11
Andreas Bauer Re: Is Spark suited for replacing a batch job using many database tables? Wed, 06 Jul, 19:39
Andreas Bauer Re: Is Spark suited for replacing a batch job using many database tables? Wed, 06 Jul, 19:54
Andreas Bauer Re: Is Spark suited for replacing a batch job using many database tables? Wed, 06 Jul, 20:21
Andreas Bauer Re: Is Spark suited for replacing a batch job using many database tables? Wed, 06 Jul, 20:24
Andreas Bauer Re: Is Spark suited for replacing a batch job using many database tables? Wed, 06 Jul, 21:29
Andrew Ash Re: How do I download 2.0? The main download page isn't showing it? Thu, 28 Jul, 00:35
Andrew Ehrlich Spark performance testing Sat, 09 Jul, 03:40
Andrew Ehrlich Re: Spark performance testing Sat, 09 Jul, 04:28
Andrew Ehrlich Re: Building standalone spark application via sbt Tue, 19 Jul, 14:53
Andrew Ehrlich Re: spark worker continuously trying to connect to master and failed in standalone mode Wed, 20 Jul, 03:12
Andrew Ehrlich Re: Heavy Stage Concentration - Ends With Failure Wed, 20 Jul, 03:20
Andrew Ehrlich Re: Is it good choice to use DAO to store results generated by spark application? Wed, 20 Jul, 03:27
Andrew Ehrlich Re: the spark job is so slow - almost frozen Wed, 20 Jul, 03:35
Andrew Ehrlich Re: Spark Job trigger in production Wed, 20 Jul, 03:37
Andrew Ehrlich Re: How to give name to Spark jobs shown in Spark UI Sat, 23 Jul, 18:55
Andrew Ehrlich Re: Error in collecting RDD as a Map - IOException in collectAsMap Sat, 23 Jul, 18:57
Andrew Ehrlich Re: spark and plot data Sat, 23 Jul, 19:01
Andrew Ehrlich Re: How to generate a sequential key in rdd across executors Sun, 24 Jul, 04:24
Andrew Ehrlich Re: Size exceeds Integer.MAX_VALUE Sun, 24 Jul, 04:31
Andrew Ehrlich Re: Size exceeds Integer.MAX_VALUE Mon, 25 Jul, 01:05
Andrew Ehrlich Re: Bzip2 to Parquet format Mon, 25 Jul, 03:00
Andy Davidson spark streaming: how come I have scheduling delay when processing time is less then batch windowing size Thu, 07 Jul, 17:47
Andy Davidson Re: spark streaming: how come I have scheduling delay when processing time is less then batch windowing size Thu, 07 Jul, 20:09
Andy Davidson is dataframe.write() async? Streaming performance problem Thu, 07 Jul, 20:59
Andy Davidson Re: Multiple aggregations over streaming dataframes Thu, 07 Jul, 22:00
Andy Davidson can I use ExectorService in my driver? was: is dataframe.write() async? Streaming performance problem Fri, 08 Jul, 17:29
Andy Davidson spark UI what does storage memory x/y mean Mon, 11 Jul, 15:52
Andy Davidson WARN FileOutputCommitter: Failed to delete the temporary output directory of task: attempt_201607111453_128606_m_000000_0 - s3n:// Mon, 11 Jul, 16:00
Andy Davidson trouble accessing driver log files using rest-api Mon, 11 Jul, 20:14
Andy Davidson Re: Spark Streaming - Direct Approach Mon, 11 Jul, 21:30
Andy Davidson /spark-ec2 script: trouble using ganglia web ui spark 1.6.1 Mon, 11 Jul, 21:49
Andy Davidson Re: Trouble while running spark at ec2 cluster Mon, 18 Jul, 16:11
Andy Davidson Re: Role-based S3 access outside of EMR Tue, 19 Jul, 21:47
Message list1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · Next »Thread · Author · Date
Box list
May 2019280
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137