Jean-Baptiste Onofré |
Re: Aggregations with scala pairs |
Thu, 18 Aug, 06:35 |
马晓宇 |
SQL Based Authorization for SparkSQL |
Wed, 03 Aug, 01:40 |
Abel Rincón |
Spark Kerberos proxy user |
Thu, 25 Aug, 10:10 |
Abel Rincón |
Re: Spark Kerberos proxy user |
Tue, 30 Aug, 10:02 |
Herman van Hövell tot Westerflier |
Re: Result code of whole stage codegen |
Fri, 05 Aug, 08:06 |
Herman van Hövell tot Westerflier |
Re: Welcoming Felix Cheung as a committer |
Mon, 08 Aug, 22:04 |
Maciej Bryński |
Result code of whole stage codegen |
Fri, 05 Aug, 07:55 |
Maciej Bryński |
Re: Spark SQL and Kryo registration |
Fri, 05 Aug, 08:07 |
Maciej Bryński |
Re: Result code of whole stage codegen |
Fri, 05 Aug, 08:18 |
Maciej Bryński |
Re: GraphFrames 0.2.0 released |
Wed, 24 Aug, 17:11 |
Maciej Bryński |
Tree for SQL Query |
Wed, 24 Aug, 19:31 |
Maciej Bryński |
Re: Tree for SQL Query |
Thu, 25 Aug, 06:23 |
Maciej Bryński |
Re: Performance of loading parquet files into case classes in Spark |
Sat, 27 Aug, 20:32 |
Maciej Bryński |
Cache'ing performance |
Sat, 27 Aug, 20:39 |
Maciej Bryński |
Re: Performance of loading parquet files into case classes in Spark |
Sun, 28 Aug, 20:12 |
Tomasz Gawęda |
Real time streaming in Spark |
Mon, 29 Aug, 20:13 |
Amit Sela |
Re: Spark SQL and Kryo registration |
Thu, 04 Aug, 15:41 |
Artur Sukhenko |
Remaining folders in .sparkStaging directory after app was killed |
Mon, 29 Aug, 16:06 |
Benjamin Fradet |
Re: Use cases around image/video processing in spark |
Wed, 10 Aug, 16:27 |
Bryan Cutler |
AccumulatorV2 += operator |
Tue, 02 Aug, 20:46 |
Bryan Cutler |
Re: AccumulatorV2 += operator |
Wed, 03 Aug, 15:05 |
Chris Fregly |
Re: Spark 2.0.1 / 2.1.0 on Maven |
Tue, 09 Aug, 18:52 |
Chris Fregly |
Re: Serving Spark ML models via a regular Python web app |
Thu, 11 Aug, 16:35 |
Chris Fregly |
Re: Serving Spark ML models via a regular Python web app |
Thu, 11 Aug, 16:42 |
Cody Koeninger |
Re: sampling operation for DStream |
Mon, 01 Aug, 16:43 |
Cody Koeninger |
Re: sampling operation for DStream |
Mon, 01 Aug, 21:01 |
Cody Koeninger |
Re: How does MapWithStateRDD distribute the data |
Wed, 03 Aug, 16:34 |
Cody Koeninger |
Re: Kafka Support new topic subscriptions without requiring restart of the streaming context |
Mon, 08 Aug, 13:20 |
Cody Koeninger |
Re: Structured Streaming with Kafka sources/sinks |
Tue, 16 Aug, 02:26 |
Cody Koeninger |
Re: Structured Streaming with Kafka sources/sinks |
Tue, 30 Aug, 15:09 |
Cody Koeninger |
Re: Structured Streaming with Kafka sources/sinks |
Tue, 30 Aug, 16:12 |
Cody Koeninger |
Re: Model abstract class in spark ml |
Wed, 31 Aug, 14:32 |
Daniel Darabos |
Re: is the Lineage of RDD stored as a byte code in memory or a file? |
Wed, 24 Aug, 13:02 |
Deepak Sharma |
Use cases around image/video processing in spark |
Wed, 10 Aug, 15:20 |
Denis Bolshakov |
spark roadmap |
Mon, 29 Aug, 08:23 |
Denny Lee |
Re: Welcoming Felix Cheung as a committer |
Tue, 09 Aug, 05:53 |
Dibyendu Bhattacharya |
Latest Release of Receiver based Kafka Consumer for Spark Streaming. |
Thu, 25 Aug, 11:33 |
Dongjoon Hyun |
Re: Welcoming Felix Cheung as a committer |
Mon, 08 Aug, 18:17 |
Dongjoon Hyun |
Re: Mesos is now a maven module |
Tue, 30 Aug, 16:56 |
Dongjoon Hyun |
Re: Mesos is now a maven module |
Tue, 30 Aug, 17:11 |
Dongjoon Hyun |
Re: Mesos is now a maven module |
Tue, 30 Aug, 20:56 |
Eric Liang |
Re: Scaling partitioned Hive table support |
Mon, 08 Aug, 19:51 |
Ewan Leith |
Re: How to resolve the SparkExecption : Size exceeds Integer.MAX_VALUE |
Mon, 15 Aug, 19:04 |
Fang Zhang |
Saving less data to improve Pregel performance in GraphX? |
Tue, 30 Aug, 01:46 |
Felix Cheung |
Re: Welcoming Felix Cheung as a committer |
Tue, 09 Aug, 04:44 |
Felix Cheung |
Re: Spark R - Loading Third Party R Library in YARN Executors |
Wed, 17 Aug, 11:16 |
Fred Reiss |
Source API requires unbounded distributed storage? |
Thu, 04 Aug, 23:38 |
Fred Reiss |
Re: Source API requires unbounded distributed storage? |
Tue, 09 Aug, 02:24 |
Fred Reiss |
Re: Structured Streaming with Kafka sources/sinks |
Mon, 29 Aug, 19:39 |
Georgios Samaras |
Fwd: KMeans calls takeSample() twice? |
Tue, 30 Aug, 16:50 |
Georgios Samaras |
Re: KMeans calls takeSample() twice? |
Tue, 30 Aug, 17:31 |
Georgios Samaras |
Re: KMeans calls takeSample() twice? |
Wed, 31 Aug, 16:29 |
Guo, Chenzhao |
Structured Streaming with Kafka sources/sinks |
Tue, 16 Aug, 02:12 |
Hao Ren |
[MLlib] Term Frequency in TF-IDF seems incorrect |
Mon, 01 Aug, 22:29 |
Hao Ren |
[SPARK-2.0][SQL] UDF containing non-serializable object does not work as expected |
Sun, 07 Aug, 21:31 |
Hao Ren |
Re: [SPARK-2.0][SQL] UDF containing non-serializable object does not work as expected |
Mon, 08 Aug, 08:02 |
Hao Ren |
Re: [SPARK-2.0][SQL] UDF containing non-serializable object does not work as expected |
Mon, 08 Aug, 21:05 |
Holden Karau |
Re: AccumulatorV2 += operator |
Tue, 02 Aug, 21:52 |
Holden Karau |
Re: AccumulatorV2 += operator |
Wed, 03 Aug, 17:02 |
Holden Karau |
Re: Apache Arrow data in buffer to RDD/DataFrame/Dataset? |
Fri, 05 Aug, 19:43 |
Holden Karau |
Re: Apache Arrow data in buffer to RDD/DataFrame/Dataset? |
Fri, 05 Aug, 21:22 |
Holden Karau |
Early Draft Structured Streaming Machine Learning |
Thu, 18 Aug, 19:33 |
Holden Karau |
Re: Persisting PySpark ML Pipelines that include custom Transformers |
Fri, 19 Aug, 19:16 |
Hyukjin Kwon |
Inquery about Spark's behaviour for configurations in Hadoop configuration instance via read/write.options() |
Fri, 05 Aug, 01:26 |
Hyukjin Kwon |
Re: Welcoming Felix Cheung as a committer |
Tue, 09 Aug, 00:49 |
Hyukjin Kwon |
Re: Sorting within partitions is not maintained in parquet? |
Thu, 11 Aug, 11:27 |
Hyukjin Kwon |
Inconsistency for nullvalue handling CSV: see SPARK-16462, SPARK-16460, SPARK-15144, SPARK-17290 and SPARK-16903 |
Tue, 30 Aug, 02:55 |
Ignacio Zendejas |
Re: RFC: Remote "HBaseTest" from examples? |
Thu, 18 Aug, 20:43 |
Jacek Laskowski |
Re: Spark SQL and Kryo registration |
Thu, 04 Aug, 15:14 |
Jacek Laskowski |
Re: Welcoming Felix Cheung as a committer |
Sat, 13 Aug, 01:45 |
Jacek Laskowski |
Re: Spark 2.0.1 / 2.1.0 on Maven |
Mon, 15 Aug, 01:11 |
Jacek Laskowski |
Re: Spark 2.0.1 / 2.1.0 on Maven |
Tue, 16 Aug, 00:41 |
Jacek Laskowski |
Re: GraphFrames 0.2.0 released |
Wed, 17 Aug, 01:18 |
Jacek Laskowski |
[master] ERROR RetryingHMSHandler: AlreadyExistsException(message:Database default already exists) |
Wed, 17 Aug, 02:33 |
Jacek Laskowski |
Re: [master] ERROR RetryingHMSHandler: AlreadyExistsException(message:Database default already exists) |
Wed, 17 Aug, 06:06 |
Jacek Laskowski |
How is mapped LogicalPlan to RDDs eventually if ever? How about Dataset? |
Wed, 17 Aug, 22:00 |
Jacek Laskowski |
Found a typo in Catalyst's exception and want to write a test -- help needed |
Thu, 18 Aug, 06:46 |
Jacek Laskowski |
Why is isStreaming naming-inconsistent with analyzed and resolved in LogicalPlan? |
Mon, 22 Aug, 09:02 |
Jacek Laskowski |
Analyzer.resolver a duplicate of CatalystConf.resolver? |
Mon, 22 Aug, 09:28 |
Jacek Laskowski |
Re: Spark dev-setup |
Wed, 24 Aug, 10:38 |
Jacek Laskowski |
Re: Spark dev-setup |
Wed, 24 Aug, 13:35 |
Jacek Laskowski |
Re: Mesos is now a maven module |
Fri, 26 Aug, 21:14 |
Jacek Laskowski |
3Ps for Datasets not available?! (=Parquet Predicate Pushdown) |
Tue, 30 Aug, 08:20 |
Jacek Laskowski |
Re: 3Ps for Datasets not available?! (=Parquet Predicate Pushdown) |
Tue, 30 Aug, 09:44 |
Jacek Laskowski |
Re: Reynold on vacation next two weeks |
Tue, 30 Aug, 15:36 |
Jason Moore |
Sorting within partitions is not maintained in parquet? |
Thu, 11 Aug, 06:23 |
Jeff Zhang |
Re: Welcoming Felix Cheung as a committer |
Tue, 09 Aug, 01:14 |
Jeremy Smith |
Re: Apache Arrow data in buffer to RDD/DataFrame/Dataset? |
Fri, 05 Aug, 20:14 |
Jeremy Smith |
Parquet partitioning / appends |
Thu, 18 Aug, 20:01 |
Jerry Lam |
Broadcast Variable Life Cycle |
Sun, 21 Aug, 17:07 |
Jerry Lam |
Re: Broadcast Variable Life Cycle |
Mon, 29 Aug, 15:30 |
Jerry Lam |
Re: Broadcast Variable Life Cycle |
Tue, 30 Aug, 15:43 |
Jerry Lam |
Re: Broadcast Variable Life Cycle |
Tue, 30 Aug, 16:12 |
Jim Pivarski |
Re: Apache Arrow data in buffer to RDD/DataFrame/Dataset? |
Fri, 05 Aug, 21:18 |
Jim Pivarski |
Re: Apache Arrow data in buffer to RDD/DataFrame/Dataset? |
Fri, 05 Aug, 22:53 |
Joseph Bradley |
Re: Welcoming Felix Cheung as a committer |
Tue, 16 Aug, 22:51 |
Joseph Bradley |
Re: GraphFrames 0.2.0 released |
Sat, 27 Aug, 00:10 |
Julien Dumazert |
Performance of loading parquet files into case classes in Spark |
Sat, 27 Aug, 13:27 |
Julien Dumazert |
Re: Performance of loading parquet files into case classes in Spark |
Sun, 28 Aug, 19:27 |
Julien Dumazert |
Re: Performance of loading parquet files into case classes in Spark |
Mon, 29 Aug, 19:58 |