spark-user mailing list archives: December 2018

Site index · List index
Message list1 · 2 · 3 · Next »Thread · Author · Date
15313776907 Re: Spark Sql group by less performant Tue, 11 Dec, 01:09
15313776907 Re: how to generate a larg dataset paralleled Fri, 14 Dec, 08:39
大啊 Re:Re: [Spark SQL]use zstd, No enum constant parquet.hadoop.metadata.CompressionCodecName.ZSTD Fri, 21 Dec, 06:18
大啊 Re:running updates using SPARK Fri, 21 Dec, 06:20
大啊 Re:Re: running updates using SPARK Mon, 24 Dec, 02:17
大啊 Re:jdbc spark streaming Fri, 28 Dec, 02:29
大啊 Re:Re: Async action in Dataframe Sat, 29 Dec, 03:23
Chang.Wu Executor launched but no tasks is submitted Mon, 03 Dec, 13:11
☼ R Nair Re: Connection issue with AWS S3 from PySpark 2.3.1 Fri, 21 Dec, 15:36
李斌松 spark2.4 arrow enabled true,error log not returned Sat, 15 Dec, 06:39
李斌松 [Spark SQL]use zstd, No enum constant parquet.hadoop.metadata.CompressionCodecName.ZSTD Thu, 20 Dec, 03:38
李斌松 [spark-sql] Hive failing on insert empty array into parquet table Sat, 29 Dec, 08:08
李斌松 Re: [spark-sql] Hive failing on insert empty array into parquet table Sat, 29 Dec, 11:54
Andrés Ivaldi Spark version performance Wed, 12 Dec, 01:57
Andrés Ivaldi Re: Spark Core - Embed in other application Wed, 12 Dec, 02:53
朱 婧迪 how to register UDF when scala code invoke python Fri, 07 Dec, 08:56
Jörn Franke Re: Spark Scala reading from Google Cloud BigQuery table throws error Tue, 18 Dec, 11:16
Jörn Franke Re: [SPARK SQL] Difference between 'Hive on spark' and Spark SQL Thu, 20 Dec, 08:34
Jörn Franke Re: spark application takes significant some time to succeed even after all jobs are completed Tue, 25 Dec, 12:00
Jörn Franke Re: spark application takes significant some time to succeed even after all jobs are completed Tue, 25 Dec, 12:47
Aakash Basu Connection issue with AWS S3 from PySpark 2.3.1 Fri, 21 Dec, 06:28
Aakash Basu Re: Connection issue with AWS S3 from PySpark 2.3.1 Fri, 21 Dec, 07:50
Aakash Basu Re: Connection issue with AWS S3 from PySpark 2.3.1 Fri, 21 Dec, 08:51
Aakash Basu Re: Connection issue with AWS S3 from PySpark 2.3.1 Fri, 21 Dec, 12:17
Aakash Basu Re: Connection issue with AWS S3 from PySpark 2.3.1 Fri, 21 Dec, 13:46
Aakash Basu Re: Connection issue with AWS S3 from PySpark 2.3.1 Fri, 21 Dec, 16:23
Abdeali Kothari PicklingError - Can't pickle py4j.protocol.Py4JJavaError - it's not the same object Sun, 02 Dec, 12:53
Abdeali Kothari Identifying cause of exception in PySpark Mon, 10 Dec, 03:40
Abhijeet Kumar Join happening after watermark time Thu, 06 Dec, 08:41
Abhijeet Kumar Why does join use rows that were sent after watermark of 20 seconds? Mon, 10 Dec, 11:53
Abhijeet Kumar Re: Why does join use rows that were sent after watermark of 20 seconds? Tue, 11 Dec, 05:52
Abhijeet Kumar Spark not working with Hadoop 4mc compression Thu, 20 Dec, 06:03
Affan Syed OData compliant API for Spark Wed, 05 Dec, 05:14
Affan Syed Re: OData compliant API for Spark Thu, 06 Dec, 08:21
Akshay Mendole Spark executors exceeding heap space allocated Fri, 21 Dec, 15:47
Akshay Mendole spark application takes significant some time to succeed even after all jobs are completed Tue, 25 Dec, 11:51
Akshay Mendole Tuning G1GC params for aggressive garbage collection? Tue, 25 Dec, 11:57
Akshay Mendole Re: spark application takes significant some time to succeed even after all jobs are completed Tue, 25 Dec, 12:08
Akshay Mendole Re: Tuning G1GC params for aggressive garbage collection? Wed, 26 Dec, 02:11
Alchemist Spark Streaming job is missing Streaming tab from the UI on Ambari Thu, 06 Dec, 03:54
Alchemist How to fix spark streaming missing tab Thu, 06 Dec, 13:20
Alexander Chermenin State size on joining two streams Tue, 18 Dec, 08:58
Alexey Spark jdbc postgres numeric array Mon, 31 Dec, 15:13
Anastasios Zouzias Re: Packaging kafka certificates in uber jar Tue, 25 Dec, 13:26
Andrew Melo Questions about caching Tue, 11 Dec, 17:13
Andrew Melo Re: What are the alternatives to nested DataFrames? Sat, 29 Dec, 01:47
Andrew Old Dataset experimental interfaces Tue, 18 Dec, 20:54
Antoine DUBOIS Using spark and mesos container with host_path volume Mon, 03 Dec, 15:44
Ascot Moss Powered By Spark Sat, 22 Dec, 08:13
Bin Fan Re: Questions about caching Tue, 25 Dec, 05:20
Cheikh_SOW [Spark cluster standalone v2.4.0] - problems with reverse proxy functionnality regarding submitted applications in cluster mode and the spark history server ui Thu, 20 Dec, 16:42
Chris Teoh Re: Convert RDD[Iterrable[MyCaseClass]] to RDD[MyCaseClass] Sat, 01 Dec, 10:09
Chris Teoh Re: Convert RDD[Iterrable[MyCaseClass]] to RDD[MyCaseClass] Sat, 01 Dec, 11:17
Chunpeng Wang SGD for pyspark Tue, 11 Dec, 16:29
Colin Williams Packaging kafka certificates in uber jar Mon, 24 Dec, 20:29
Colin Williams Re: Packaging kafka certificates in uber jar Wed, 26 Dec, 12:16
Colin Williams Corrupt record handling in spark structured streaming and from_json function Wed, 26 Dec, 21:55
Colin Williams Re: Corrupt record handling in spark structured streaming and from_json function Wed, 26 Dec, 22:42
Colin Williams Re: Corrupt record handling in spark structured streaming and from_json function Thu, 27 Dec, 02:01
Conrad Lee Re: Job hangs in blocked task in final parquet write stage Tue, 04 Dec, 08:45
Conrad Lee Re: Job hangs in blocked task in final parquet write stage Tue, 11 Dec, 08:05
Daniel O' Shaughnessy [No Subject] Wed, 19 Dec, 13:59
David Markovitz Run SQL on files directly Sat, 08 Dec, 17:39
David Markovitz RE: Run SQL on files directly Sat, 08 Dec, 21:55
Davide.Mandrini Re: Driver Memory taken up by BlockManager Fri, 14 Dec, 10:19
Debajyoti Roy Spark Dataset transformations for time based events Wed, 26 Dec, 07:34
Devender Yadav Add column value in the dataset on the basis of a condition Tue, 18 Dec, 13:47
Devender Yadav Re: Add column value in the dataset on the basis of a condition Tue, 18 Dec, 15:17
Etienne Chauchot Re: [Apache Beam] Custom DataSourceV2 instanciation: parameters passing and Encoders Tue, 18 Dec, 16:09
Fawze Abujaber Re: How to clean up logs-dirs and local-dirs of running spark streaming in yarn cluster mode Wed, 26 Dec, 03:26
FengYu Cao How do you set POSIX rlimit on mesos Fri, 28 Dec, 03:04
Gabor Somogyi Re: "failed to get records for spark-executor after polling for ***" error Mon, 03 Dec, 10:11
Gaurav Gupta Getting FileNotFoundException and LeaseExpired Exception while writing a df to hdfs path Mon, 24 Dec, 20:04
Georg Heiler Re: Spark Sql group by less performant Tue, 11 Dec, 07:44
Gerard Maas Re: Convert RDD[Iterrable[MyCaseClass]] to RDD[MyCaseClass] Mon, 03 Dec, 16:48
Gezim Sejdiu SANSA 0.5 (Scalable Semantic Analytics Stack) Released Fri, 14 Dec, 08:28
Gezim Sejdiu Re: Connecting to Cassandra from Zeppelin on EMR cluster Sat, 22 Dec, 19:41
GmailLiang Unsubscribe Tue, 04 Dec, 12:41
Gourav Sengupta Re: How to track batch jobs in spark ? Thu, 06 Dec, 22:13
Gourav Sengupta Re: How to fix spark streaming missing tab Thu, 06 Dec, 22:18
Gourav Sengupta running updates using SPARK Thu, 20 Dec, 22:05
Gourav Sengupta Re: running updates using SPARK Fri, 21 Dec, 08:55
Gourav Sengupta Re: Connection issue with AWS S3 from PySpark 2.3.1 Mon, 24 Dec, 14:22
Gourav Sengupta Python 3.x Sun, 30 Dec, 15:25
Hans Fischer Recommended Node Usage Tue, 04 Dec, 20:17
Holden Karau Re: How to preserve event order per key in Structured Streaming Repartitioning By Key? Wed, 12 Dec, 04:58
JF Chen "failed to get records for spark-executor after polling for ***" error Mon, 03 Dec, 09:32
JF Chen how to change temp directory when spark write data ? Wed, 05 Dec, 08:11
JF Chen Re: how to change temp directory when spark write data ? Wed, 05 Dec, 13:50
JF Chen How to set Spark Streaming batch start time? Wed, 12 Dec, 02:00
James Starks Re: Convert RDD[Iterrable[MyCaseClass]] to RDD[MyCaseClass] Mon, 03 Dec, 12:53
James Starks Parallel read parquet file, write to postgresql Mon, 03 Dec, 13:40
Jean Georges Perrin Re: OData compliant API for Spark Wed, 05 Dec, 13:50
Jean Georges Perrin Re: how to generate a larg dataset paralleled Fri, 14 Dec, 03:10
Jean Georges Perrin Multiple sessions in one application? Wed, 19 Dec, 11:12
JiaTao Tao Async action in Dataframe Sat, 22 Dec, 08:47
JiaTao Tao About LocalProperty in sqlConf Sat, 22 Dec, 09:16
Jiaan Geng Re: Spark 2.2.1 - Operation not allowed: alter table replace columns Wed, 19 Dec, 11:15
Jiaan Geng Re: Read Time from a remote data source Thu, 20 Dec, 03:49
Jiaan Geng Re: Multiple sessions in one application? Fri, 21 Dec, 02:02
Message list1 · 2 · 3 · Next »Thread · Author · Date
Box list
May 2019280
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137