spark-user mailing list archives: September 2019

Site index · List index
Message list1 · 2 · Next »Thread · Author · Date
☼ R Nair Partitioning query Fri, 13 Sep, 19:47
Jörn Franke Re: Control Sqoop job from Spark job Tue, 03 Sep, 07:29
Jörn Franke Re: Conflicting PySpark Storage Level Defaults? Mon, 16 Sep, 07:02
Abdeali Kothari Re: script running in jupyter 6-7x faster than spark submit Wed, 11 Sep, 03:28
Abdeali Kothari Re: script running in jupyter 6-7x faster than spark submit Wed, 11 Sep, 14:52
Abdeali Kothari Re: script running in jupyter 6-7x faster than spark submit Wed, 11 Sep, 17:40
Abhinesh Hada [Spark SQL]: Does Union operation followed by drop duplicate follows "keep first" Fri, 13 Sep, 15:43
Abhinesh Hada Re: [Spark SQL]: Does Union operation followed by drop duplicate follows "keep first" Sat, 14 Sep, 20:41
Ahn, Daniel [Spark SS] Spark-23541 Backward Compatibility on 2.3.2 Thu, 26 Sep, 19:39
Alex Landa Re: Monitor Spark Applications Fri, 13 Sep, 05:17
Alex Landa Re: Monitor Spark Applications Sun, 15 Sep, 09:00
Ankit Khettry OOM Error Fri, 06 Sep, 23:33
Ankit Khettry Re: OOM Error Sat, 07 Sep, 06:18
Ankit Khettry Re: OOM Error Sat, 07 Sep, 09:19
Ankit Khettry Re: OOM Error Sat, 07 Sep, 09:56
Ankit Khettry Re: OOM Error Sat, 07 Sep, 13:56
Arun Mahadevan Re: custom rdd - do I need a hadoop input format? Tue, 17 Sep, 17:46
Bin Fan Re: Can I set the Alluxio WriteType in Spark applications? Thu, 19 Sep, 17:43
Bin Fan Re: Low cache hit ratio when running Spark on Alluxio Thu, 19 Sep, 18:02
Bryan Cutler Re: question about pyarrow.Table to pyspark.DataFrame conversion Tue, 10 Sep, 19:17
Burak Yavuz Re: Spark Kafka Streaming making progress but there is no data to be consumed Wed, 11 Sep, 22:12
Burak Yavuz Re: Spark Kafka Streaming making progress but there is no data to be consumed Thu, 12 Sep, 01:29
Charles vinodh Spark Kafka Streaming making progress but there is no data to be consumed Wed, 11 Sep, 21:38
Charles vinodh Re: Spark Kafka Streaming making progress but there is no data to be consumed Wed, 11 Sep, 22:24
Charles vinodh Re: Spark Kafka Streaming making progress but there is no data to be consumed Thu, 12 Sep, 01:08
Charles vinodh Re: Spark Kafka Streaming making progress but there is no data to be consumed Thu, 12 Sep, 02:49
Chee Yee Lim Re: Efficient cosine similarity computation Tue, 24 Sep, 01:14
Chetan Khatri Re: Control Sqoop job from Spark job Mon, 02 Sep, 11:11
Chetan Khatri Re: Control Sqoop job from Spark job Mon, 02 Sep, 11:12
Chris Teoh Re: Control Sqoop job from Spark job Mon, 02 Sep, 21:43
Chris Teoh Re: OOM Error Sat, 07 Sep, 07:35
Chris Teoh Re: OOM Error Sat, 07 Sep, 09:26
Chris Teoh Re: OOM Error Sat, 07 Sep, 10:35
David Zhou Re: Start point to read source codes Thu, 05 Sep, 20:33
David Zhou Re: how to refresh the loaded non-streaming dataframe for each steaming batch ? Fri, 06 Sep, 18:10
David Zhou Question on streaming job wait and re-run Fri, 06 Sep, 21:07
David Zhou Re: how to refresh the loaded non-streaming dataframe for each steaming batch ? Fri, 06 Sep, 21:18
Dean Arnold Inconsistent dataset behavior between file and in-memory versions Thu, 12 Sep, 18:41
Dhaval Patel Re: Spark Kafka Streaming making progress but there is no data to be consumed Thu, 12 Sep, 02:03
Dhaval Patel Re: [Spark SQL]: Does Union operation followed by drop duplicate follows "keep first" Sat, 14 Sep, 21:58
Dhrubajyoti Hati script running in jupyter 6-7x faster than spark submit Tue, 10 Sep, 18:32
Dhrubajyoti Hati Re: script running in jupyter 6-7x faster than spark submit Wed, 11 Sep, 03:05
Dhrubajyoti Hati Re: script running in jupyter 6-7x faster than spark submit Wed, 11 Sep, 03:25
Dhrubajyoti Hati Re: script running in jupyter 6-7x faster than spark submit Wed, 11 Sep, 04:15
Dhrubajyoti Hati Re: script running in jupyter 6-7x faster than spark submit Wed, 11 Sep, 07:17
Dhrubajyoti Hati Re: script running in jupyter 6-7x faster than spark submit Wed, 11 Sep, 14:03
Dhrubajyoti Hati Re: script running in jupyter 6-7x faster than spark submit Wed, 11 Sep, 16:58
Dhrubajyoti Hati Re: script running in jupyter 6-7x faster than spark submit Wed, 11 Sep, 17:02
Dhrubajyoti Hati Collections passed from driver to executors Fri, 20 Sep, 06:22
Dhrubajyoti Hati Re: Collections passed from driver to executors Tue, 24 Sep, 03:17
Dhrubajyoti Hati Re: Collections passed from driver to executors Tue, 24 Sep, 04:04
Dongjoon Hyun [ANNOUNCE] Announcing Apache Spark 2.4.4 Sun, 01 Sep, 21:54
Fangyuan Liu [Ask for help] How to manually submit offsetRanges Fri, 20 Sep, 12:23
Femi Anthony PySpark with custom transformer project organization Mon, 23 Sep, 21:13
G R Unable to verify in-transit encryption Mon, 16 Sep, 18:25
Gabor Somogyi Re: [Spark Streaming Kafka 0-10] - What was the reason for adding "spark-executor-" prefix to group id in executor configurations Thu, 05 Sep, 16:13
Gabor Somogyi Re: [Spark Streaming Kafka 0-10] - What was the reason for adding "spark-executor-" prefix to group id in executor configurations Fri, 06 Sep, 07:38
Gabor Somogyi Re: Access all of the custom streaming query listeners that were registered to spark session Wed, 11 Sep, 07:55
Georg Heiler [No Subject] Thu, 19 Sep, 09:14
Gourav Sengupta Re: Structured Streaming: How to add a listener for when a batch is complete Wed, 04 Sep, 15:02
Hichame El Khalfi Re: Start point to read source codes Thu, 05 Sep, 20:30
Himali Patel Test mail Thu, 05 Sep, 10:20
Himali Patel Tune hive query launched thru spark-yarn job. Thu, 05 Sep, 12:10
Himali Patel Re: Tune hive query launched thru spark-yarn job. Thu, 05 Sep, 17:32
Holden Karau Re: Announcing .NET for Apache Spark 0.5.0 Mon, 30 Sep, 16:39
Hyukjin Kwon Re: [ANNOUNCE] Announcing Apache Spark 2.4.4 Mon, 02 Sep, 05:16
Hyukjin Kwon Re: DataSourceV2: pushFilters() is not invoked for each read call - spark 2.3.2 Fri, 06 Sep, 07:20
Jack Kolokasis Shuffle Spill to Disk Sat, 28 Sep, 19:45
Jerry Vinokurov Re: intermittent Kryo serialization failures in Spark Tue, 17 Sep, 14:37
Jerry Vinokurov Re: intermittent Kryo serialization failures in Spark Wed, 18 Sep, 14:37
Jerry Vinokurov Re: intermittent Kryo serialization failures in Spark Thu, 26 Sep, 02:32
Julien Laurenceau Re: Parquet read performance for different schemas Fri, 20 Sep, 13:57
Julien Laurenceau Re: intermittent Kryo serialization failures in Spark Fri, 20 Sep, 14:00
Jungtaek Lim Kafka offset committer tool for structured streaming query Mon, 23 Sep, 15:59
Kamal7.Ku...@ril.com spark 2.x design docs Thu, 19 Sep, 06:04
Kamal7.Ku...@ril.com RE: [External]Re: spark 2.x design docs Thu, 19 Sep, 07:16
Kazuaki Ishizaki [ANNOUNCE] Announcing Apache Spark 2.3.4 Tue, 10 Sep, 04:37
Kevin Mellott Re: Exception when reading multiline JSON file Thu, 12 Sep, 22:55
Kumaresh AK Exception when reading multiline JSON file Thu, 12 Sep, 17:03
Marcelo Valle custom rdd - do I need a hadoop input format? Tue, 17 Sep, 15:28
Marcelo Valle Re: custom rdd - do I need a hadoop input format? Tue, 17 Sep, 15:47
Marcelo Valle Re: custom rdd - do I need a hadoop input format? Wed, 18 Sep, 09:16
Marcin Tustin Re: Collecting large dataset Thu, 05 Sep, 18:27
Mario Amatucci unsubscribe Thu, 19 Sep, 07:18
Mark Zhao Can I set the Alluxio WriteType in Spark applications? Tue, 17 Sep, 14:52
Mich Talebzadeh Re: Control Sqoop job from Spark job Mon, 02 Sep, 18:05
Mich Talebzadeh Google Cloud and Spark in the docker consideration for rreal time streaming data Mon, 23 Sep, 19:04
Natalie Ruiz Structured Streaming: How to add a listener for when a batch is complete Tue, 03 Sep, 22:25
Natalie Ruiz Access all of the custom streaming query listeners that were registered to spark session Tue, 10 Sep, 20:18
Nathan Kronenfeld Problem upgrading from 2.3.1 to 2.4.3 with gradle Mon, 09 Sep, 20:48
Nathan Kronenfeld Re: [Spark SQL]: Does Union operation followed by drop duplicate follows "keep first" Fri, 13 Sep, 19:28
Nicolas Paris graphx vs graphframes Sun, 22 Sep, 20:17
Nilkanth Patel Standalone Spark, How to find (driver's ) final status for an application Thu, 26 Sep, 05:42
Patrick McCarthy Re: script running in jupyter 6-7x faster than spark submit Tue, 10 Sep, 19:14
Patrick McCarthy Re: script running in jupyter 6-7x faster than spark submit Wed, 11 Sep, 13:36
Patrick McCarthy Re: [Spark SQL]: Does Union operation followed by drop duplicate follows "keep first" Fri, 13 Sep, 17:20
Peter Liu Re: read image or binary files / spark 2.3 Thu, 05 Sep, 18:13
Peter Liu Re: read binary files (for stream reader) / spark 2.3 Mon, 09 Sep, 14:07
Praful Rana How to integrates MLeap to Spark Structured Streaming Tue, 17 Sep, 13:32
Praful Rana How to Integrate Spark mllib Streaming Training Models To Spark Structured Streaming Tue, 17 Sep, 13:38
Message list1 · 2 · Next »Thread · Author · Date
Box list
Oct 201966
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137