spark-dev mailing list archives: August 2016

Site index · List index
Message list1 · 2 · 3 · Next »Thread · Author · Date
Jean-Baptiste Onofré Re: Aggregations with scala pairs Thu, 18 Aug, 06:35
马晓宇 SQL Based Authorization for SparkSQL Wed, 03 Aug, 01:40
Abel Rincón Spark Kerberos proxy user Thu, 25 Aug, 10:10
Abel Rincón Re: Spark Kerberos proxy user Tue, 30 Aug, 10:02
Herman van Hövell tot Westerflier Re: Result code of whole stage codegen Fri, 05 Aug, 08:06
Herman van Hövell tot Westerflier Re: Welcoming Felix Cheung as a committer Mon, 08 Aug, 22:04
Maciej Bryński Result code of whole stage codegen Fri, 05 Aug, 07:55
Maciej Bryński Re: Spark SQL and Kryo registration Fri, 05 Aug, 08:07
Maciej Bryński Re: Result code of whole stage codegen Fri, 05 Aug, 08:18
Maciej Bryński Re: GraphFrames 0.2.0 released Wed, 24 Aug, 17:11
Maciej Bryński Tree for SQL Query Wed, 24 Aug, 19:31
Maciej Bryński Re: Tree for SQL Query Thu, 25 Aug, 06:23
Maciej Bryński Re: Performance of loading parquet files into case classes in Spark Sat, 27 Aug, 20:32
Maciej Bryński Cache'ing performance Sat, 27 Aug, 20:39
Maciej Bryński Re: Performance of loading parquet files into case classes in Spark Sun, 28 Aug, 20:12
Tomasz Gawęda Real time streaming in Spark Mon, 29 Aug, 20:13
Amit Sela Re: Spark SQL and Kryo registration Thu, 04 Aug, 15:41
Artur Sukhenko Remaining folders in .sparkStaging directory after app was killed Mon, 29 Aug, 16:06
Benjamin Fradet Re: Use cases around image/video processing in spark Wed, 10 Aug, 16:27
Bryan Cutler AccumulatorV2 += operator Tue, 02 Aug, 20:46
Bryan Cutler Re: AccumulatorV2 += operator Wed, 03 Aug, 15:05
Chris Fregly Re: Spark 2.0.1 / 2.1.0 on Maven Tue, 09 Aug, 18:52
Chris Fregly Re: Serving Spark ML models via a regular Python web app Thu, 11 Aug, 16:35
Chris Fregly Re: Serving Spark ML models via a regular Python web app Thu, 11 Aug, 16:42
Cody Koeninger Re: sampling operation for DStream Mon, 01 Aug, 16:43
Cody Koeninger Re: sampling operation for DStream Mon, 01 Aug, 21:01
Cody Koeninger Re: How does MapWithStateRDD distribute the data Wed, 03 Aug, 16:34
Cody Koeninger Re: Kafka Support new topic subscriptions without requiring restart of the streaming context Mon, 08 Aug, 13:20
Cody Koeninger Re: Structured Streaming with Kafka sources/sinks Tue, 16 Aug, 02:26
Cody Koeninger Re: Structured Streaming with Kafka sources/sinks Tue, 30 Aug, 15:09
Cody Koeninger Re: Structured Streaming with Kafka sources/sinks Tue, 30 Aug, 16:12
Cody Koeninger Re: Model abstract class in spark ml Wed, 31 Aug, 14:32
Daniel Darabos Re: is the Lineage of RDD stored as a byte code in memory or a file? Wed, 24 Aug, 13:02
Deepak Sharma Use cases around image/video processing in spark Wed, 10 Aug, 15:20
Denis Bolshakov spark roadmap Mon, 29 Aug, 08:23
Denny Lee Re: Welcoming Felix Cheung as a committer Tue, 09 Aug, 05:53
Dibyendu Bhattacharya Latest Release of Receiver based Kafka Consumer for Spark Streaming. Thu, 25 Aug, 11:33
Dongjoon Hyun Re: Welcoming Felix Cheung as a committer Mon, 08 Aug, 18:17
Dongjoon Hyun Re: Mesos is now a maven module Tue, 30 Aug, 16:56
Dongjoon Hyun Re: Mesos is now a maven module Tue, 30 Aug, 17:11
Dongjoon Hyun Re: Mesos is now a maven module Tue, 30 Aug, 20:56
Eric Liang Re: Scaling partitioned Hive table support Mon, 08 Aug, 19:51
Ewan Leith Re: How to resolve the SparkExecption : Size exceeds Integer.MAX_VALUE Mon, 15 Aug, 19:04
Fang Zhang Saving less data to improve Pregel performance in GraphX? Tue, 30 Aug, 01:46
Felix Cheung Re: Welcoming Felix Cheung as a committer Tue, 09 Aug, 04:44
Felix Cheung Re: Spark R - Loading Third Party R Library in YARN Executors Wed, 17 Aug, 11:16
Fred Reiss Source API requires unbounded distributed storage? Thu, 04 Aug, 23:38
Fred Reiss Re: Source API requires unbounded distributed storage? Tue, 09 Aug, 02:24
Fred Reiss Re: Structured Streaming with Kafka sources/sinks Mon, 29 Aug, 19:39
Georgios Samaras Fwd: KMeans calls takeSample() twice? Tue, 30 Aug, 16:50
Georgios Samaras Re: KMeans calls takeSample() twice? Tue, 30 Aug, 17:31
Georgios Samaras Re: KMeans calls takeSample() twice? Wed, 31 Aug, 16:29
Guo, Chenzhao Structured Streaming with Kafka sources/sinks Tue, 16 Aug, 02:12
Hao Ren [MLlib] Term Frequency in TF-IDF seems incorrect Mon, 01 Aug, 22:29
Hao Ren [SPARK-2.0][SQL] UDF containing non-serializable object does not work as expected Sun, 07 Aug, 21:31
Hao Ren Re: [SPARK-2.0][SQL] UDF containing non-serializable object does not work as expected Mon, 08 Aug, 08:02
Hao Ren Re: [SPARK-2.0][SQL] UDF containing non-serializable object does not work as expected Mon, 08 Aug, 21:05
Holden Karau Re: AccumulatorV2 += operator Tue, 02 Aug, 21:52
Holden Karau Re: AccumulatorV2 += operator Wed, 03 Aug, 17:02
Holden Karau Re: Apache Arrow data in buffer to RDD/DataFrame/Dataset? Fri, 05 Aug, 19:43
Holden Karau Re: Apache Arrow data in buffer to RDD/DataFrame/Dataset? Fri, 05 Aug, 21:22
Holden Karau Early Draft Structured Streaming Machine Learning Thu, 18 Aug, 19:33
Holden Karau Re: Persisting PySpark ML Pipelines that include custom Transformers Fri, 19 Aug, 19:16
Hyukjin Kwon Inquery about Spark's behaviour for configurations in Hadoop configuration instance via read/write.options() Fri, 05 Aug, 01:26
Hyukjin Kwon Re: Welcoming Felix Cheung as a committer Tue, 09 Aug, 00:49
Hyukjin Kwon Re: Sorting within partitions is not maintained in parquet? Thu, 11 Aug, 11:27
Hyukjin Kwon Inconsistency for nullvalue handling CSV: see SPARK-16462, SPARK-16460, SPARK-15144, SPARK-17290 and SPARK-16903 Tue, 30 Aug, 02:55
Ignacio Zendejas Re: RFC: Remote "HBaseTest" from examples? Thu, 18 Aug, 20:43
Jacek Laskowski Re: Spark SQL and Kryo registration Thu, 04 Aug, 15:14
Jacek Laskowski Re: Welcoming Felix Cheung as a committer Sat, 13 Aug, 01:45
Jacek Laskowski Re: Spark 2.0.1 / 2.1.0 on Maven Mon, 15 Aug, 01:11
Jacek Laskowski Re: Spark 2.0.1 / 2.1.0 on Maven Tue, 16 Aug, 00:41
Jacek Laskowski Re: GraphFrames 0.2.0 released Wed, 17 Aug, 01:18
Jacek Laskowski [master] ERROR RetryingHMSHandler: AlreadyExistsException(message:Database default already exists) Wed, 17 Aug, 02:33
Jacek Laskowski Re: [master] ERROR RetryingHMSHandler: AlreadyExistsException(message:Database default already exists) Wed, 17 Aug, 06:06
Jacek Laskowski How is mapped LogicalPlan to RDDs eventually if ever? How about Dataset? Wed, 17 Aug, 22:00
Jacek Laskowski Found a typo in Catalyst's exception and want to write a test -- help needed Thu, 18 Aug, 06:46
Jacek Laskowski Why is isStreaming naming-inconsistent with analyzed and resolved in LogicalPlan? Mon, 22 Aug, 09:02
Jacek Laskowski Analyzer.resolver a duplicate of CatalystConf.resolver? Mon, 22 Aug, 09:28
Jacek Laskowski Re: Spark dev-setup Wed, 24 Aug, 10:38
Jacek Laskowski Re: Spark dev-setup Wed, 24 Aug, 13:35
Jacek Laskowski Re: Mesos is now a maven module Fri, 26 Aug, 21:14
Jacek Laskowski 3Ps for Datasets not available?! (=Parquet Predicate Pushdown) Tue, 30 Aug, 08:20
Jacek Laskowski Re: 3Ps for Datasets not available?! (=Parquet Predicate Pushdown) Tue, 30 Aug, 09:44
Jacek Laskowski Re: Reynold on vacation next two weeks Tue, 30 Aug, 15:36
Jason Moore Sorting within partitions is not maintained in parquet? Thu, 11 Aug, 06:23
Jeff Zhang Re: Welcoming Felix Cheung as a committer Tue, 09 Aug, 01:14
Jeremy Smith Re: Apache Arrow data in buffer to RDD/DataFrame/Dataset? Fri, 05 Aug, 20:14
Jeremy Smith Parquet partitioning / appends Thu, 18 Aug, 20:01
Jerry Lam Broadcast Variable Life Cycle Sun, 21 Aug, 17:07
Jerry Lam Re: Broadcast Variable Life Cycle Mon, 29 Aug, 15:30
Jerry Lam Re: Broadcast Variable Life Cycle Tue, 30 Aug, 15:43
Jerry Lam Re: Broadcast Variable Life Cycle Tue, 30 Aug, 16:12
Jim Pivarski Re: Apache Arrow data in buffer to RDD/DataFrame/Dataset? Fri, 05 Aug, 21:18
Jim Pivarski Re: Apache Arrow data in buffer to RDD/DataFrame/Dataset? Fri, 05 Aug, 22:53
Joseph Bradley Re: Welcoming Felix Cheung as a committer Tue, 16 Aug, 22:51
Joseph Bradley Re: GraphFrames 0.2.0 released Sat, 27 Aug, 00:10
Julien Dumazert Performance of loading parquet files into case classes in Spark Sat, 27 Aug, 13:27
Julien Dumazert Re: Performance of loading parquet files into case classes in Spark Sun, 28 Aug, 19:27
Julien Dumazert Re: Performance of loading parquet files into case classes in Spark Mon, 29 Aug, 19:58
Message list1 · 2 · 3 · Next »Thread · Author · Date
Box list
Dec 201959
Nov 2019215
Oct 2019229
Sep 2019241
Aug 2019236
Jul 2019138
Jun 2019147
May 2019168
Apr 2019260
Mar 2019344
Feb 2019300
Jan 2019270
Dec 2018194
Nov 2018247
Oct 2018396
Sep 2018354
Aug 2018304
Jul 2018283
Jun 2018260
May 2018211
Apr 2018198
Mar 2018172
Feb 2018242
Jan 2018232
Dec 2017134
Nov 2017243
Oct 2017151
Sep 2017256
Aug 2017253
Jul 2017142
Jun 2017241
May 2017179
Apr 2017157
Mar 2017175
Feb 2017277
Jan 2017383
Dec 2016342
Nov 2016395
Oct 2016461
Sep 2016374
Aug 2016284
Jul 2016354
Jun 2016395
May 2016315
Apr 2016445
Mar 2016436
Feb 2016324
Jan 2016285
Dec 2015466
Nov 2015531
Oct 2015419
Sep 2015482
Aug 2015352
Jul 2015556
Jun 2015437
May 2015557
Apr 2015606
Mar 2015456
Feb 2015444
Jan 2015379
Dec 2014413
Nov 2014499
Oct 2014427
Sep 2014452
Aug 2014531
Jul 2014491
Jun 2014231
May 2014439
Apr 2014273
Mar 20142861
Feb 20142878
Jan 2014385
Dec 2013228
Nov 2013100
Oct 2013121
Sep 2013313
Aug 2013140
Jul 201387
Jun 201333