spark-user mailing list archives: July 2019

Site index · List index
Message list1 · 2 · Next »Thread · Author · Date
15313776907 Re: Core allocation is scattered Fri, 26 Jul, 01:35
Guillermo Ortiz Fernández Parse RDD[Seq[String]] to DataFrame with types. Mon, 15 Jul, 22:52
José Luis Pedrosa Spark 2.4.3 with hadoop 3.2 docker image. Thu, 04 Jul, 17:13
xiaobo NoSuchMethodError: org.apache.spark.network.util.AbstractFileRegion.transferred Tue, 16 Jul, 04:03
xiaobo Re: NoSuchMethodError: org.apache.spark.network.util.AbstractFileRegion.transferred Tue, 16 Jul, 04:17
Jörn Franke Re: Spark SaveMode Sat, 20 Jul, 05:40
Jörn Franke Re: Logistic Regression Iterations causing High GC in Spark 2.3 Mon, 29 Jul, 07:07
Aayush Ranaut Re: Spark Write method not ignoring double quotes in the csv file Fri, 12 Jul, 04:04
Aayush Ranaut Re: Long-Running Spark application doesn't clean old shuffle data correctly Sun, 21 Jul, 06:07
Abdeali Kothari Re: [pyspark 2.3+] CountDistinct Tue, 02 Jul, 04:20
Abdeali Kothari Usage of PyArrow in Spark Wed, 17 Jul, 04:18
Abdeali Kothari Re: Usage of PyArrow in Spark Thu, 18 Jul, 07:01
Abhishek Somani New Spark Datasource for Hive ACID tables Fri, 26 Jul, 12:37
Abhishek Somani Re: New Spark Datasource for Hive ACID tables Fri, 26 Jul, 14:47
Abhishek Somani Re: New Spark Datasource for Hive ACID tables Sun, 28 Jul, 02:20
Abhishek Somani Re: New Spark Datasource for Hive ACID tables Sun, 28 Jul, 03:14
Alex A. Reda Re: Learning Spark Fri, 05 Jul, 13:49
Alex Landa Long-Running Spark application doesn't clean old shuffle data correctly Sun, 21 Jul, 06:01
Alex Landa Re: Long-Running Spark application doesn't clean old shuffle data correctly Sun, 21 Jul, 07:19
Alex Landa Re: Long-Running Spark application doesn't clean old shuffle data correctly Wed, 24 Jul, 05:34
Alexander Czech How to use HDFS >3.1.1 with spark 2.3.3 to output parquet files to S3? Sun, 14 Jul, 22:10
Amit Sharma Dynamic allocation not working Tue, 09 Jul, 01:57
Amit Sharma Re: spark standalone mode problem about executor add and removed again and again! Thu, 18 Jul, 01:56
Amit Sharma spark dataset.cache is not thread safe Sun, 21 Jul, 23:18
Amit Sharma Re: spark dataset.cache is not thread safe Tue, 23 Jul, 02:08
Amit Sharma Core allocation is scattered Thu, 25 Jul, 12:23
Anil Kulkarni Spark CSV Quote only NOT NULL Thu, 11 Jul, 20:45
Anil Kulkarni Re: Spark CSV Quote only NOT NULL Thu, 11 Jul, 23:09
Artur Sukhenko Re: event log directory(spark-history) filled by large .inprogress files for spark streaming applications Wed, 17 Jul, 15:03
Arwin Tio Parquet 'bucketBy' creates a ton of files Thu, 04 Jul, 07:22
Aslan Bakirov Unsubscribe Fri, 19 Jul, 09:40
Balakumar iyer S Spark 2.3 Dataframe Grouby operation throws IllegalArgumentException on Large dataset Mon, 22 Jul, 10:57
Balakumar iyer S Re: Spark 2.3 Dataframe Grouby operation throws IllegalArgumentException on Large dataset Wed, 24 Jul, 04:15
Bartek Dobija Re: Spark and Oozie Fri, 19 Jul, 07:23
Bill Bejeck unsubscribe Thu, 11 Jul, 18:18
Bobby Evans Re: [Beginner] Run compute on large matrices and return the result in seconds? Wed, 17 Jul, 14:06
Bobby Evans Re: Spark 2.3 Dataframe Grouby operation throws IllegalArgumentException on Large dataset Mon, 22 Jul, 13:35
Bryan Cutler Re: Usage of PyArrow in Spark Thu, 18 Jul, 17:49
Chris Teoh Re: Implementing Upsert logic Through Streaming Mon, 01 Jul, 11:10
Chris Teoh Re: Map side join without broadcast Mon, 01 Jul, 21:12
Chris Teoh Re: Learning Spark Fri, 05 Jul, 10:08
Chris Teoh Re: Attempting to avoid a shuffle on join Sat, 06 Jul, 06:37
Chris Teoh Re: Spark 2.3 Dataframe Grouby operation throws IllegalArgumentException on Large dataset Wed, 24 Jul, 10:04
Conor Begley unsubscribe Fri, 12 Jul, 08:00
Danni Wu Seeking help of UDF number-float converting Mon, 01 Jul, 22:25
Danni Wu Seeking help of UDF number-float converting Mon, 08 Jul, 22:22
Dennis Suhari Spark and Oozie Fri, 19 Jul, 07:08
Dhrubajyoti Hati Logistic Regression Iterations causing High GC in Spark 2.3 Mon, 29 Jul, 06:22
Dhrubajyoti Hati Re: Logistic Regression Iterations causing High GC in Spark 2.3 Mon, 29 Jul, 09:20
Dhrubajyoti Hati Re: Logistic Regression Iterations causing High GC in Spark 2.3 Mon, 29 Jul, 14:03
Dhrubajyoti Hati Re: Logistic Regression Iterations causing High GC in Spark 2.3 Mon, 29 Jul, 14:53
Dongjoon Hyun Release Apache Spark 2.4.4 before 3.0.0 Tue, 09 Jul, 16:15
Dongjoon Hyun Re: Release Apache Spark 2.4.4 before 3.0.0 Tue, 09 Jul, 17:11
Dongjoon Hyun Re: Release Apache Spark 2.4.4 before 3.0.0 Thu, 11 Jul, 17:31
Dongjoon Hyun Re: Release Apache Spark 2.4.4 before 3.0.0 Fri, 12 Jul, 22:18
Dongjoon Hyun Re: Release Apache Spark 2.4.4 before 3.0.0 Mon, 15 Jul, 16:04
Dongjoon Hyun Re: Re: Release Apache Spark 2.4.4 before 3.0.0 Tue, 16 Jul, 16:24
Federico D'Ambrosio State of support for dynamic allocation on K8s and possible CMs Mon, 01 Jul, 07:41
Felix Cheung Re: [PySpark] [SparkR] Is it possible to invoke a PySpark function with a SparkR DataFrame? Tue, 16 Jul, 16:11
Femi Anthony Pass row to UDF and select column based on pattern match Tue, 09 Jul, 18:25
Fiske, Danny [PySpark] [SparkR] Is it possible to invoke a PySpark function with a SparkR DataFrame? Mon, 15 Jul, 13:58
Gautham Acharya [Beginner] Run compute on large matrices and return the result in seconds? Tue, 09 Jul, 23:22
Gautham Acharya RE: [Beginner] Run compute on large matrices and return the result in seconds? Thu, 11 Jul, 16:24
Gautham Acharya RE: [Beginner] Run compute on large matrices and return the result in seconds? Wed, 17 Jul, 19:13
Gautham Acharya RE: [Beginner] Run compute on large matrices and return the result in seconds? Wed, 17 Jul, 19:42
Gautham Acharya RE: [Beginner] Run compute on large matrices and return the result in seconds? Wed, 17 Jul, 21:11
Gourav Sengupta Re: Learning Spark Fri, 05 Jul, 13:23
Gourav Sengupta Re: Pass row to UDF and select column based on pattern match Wed, 10 Jul, 07:10
Gourav Sengupta Re: Parquet 'bucketBy' creates a ton of files Wed, 10 Jul, 07:13
Gourav Sengupta Re: Set TimeOut and continue with other tasks Wed, 10 Jul, 07:16
Gourav Sengupta Re: Help: What's the biggest length of SQL that's supported in SparkSQL? Fri, 12 Jul, 16:33
Gourav Sengupta Re: Spark CSV Quote only NOT NULL Sat, 13 Jul, 19:17
Gourav Sengupta repartitionByRange and number of tasks Tue, 30 Jul, 01:05
Hieu Nguyen [No Subject] Mon, 22 Jul, 04:16
Hyukjin Kwon Re: Usage of PyArrow in Spark Wed, 17 Jul, 11:17
Information Technologies Looking for a developer to help us with a small ETL project using Spark and Kubernetes Thu, 18 Jul, 22:47
Jacek Laskowski Re: Release Apache Spark 2.4.4 before 3.0.0 Thu, 11 Jul, 20:30
Jack Kolokasis Spark and Java10 Sat, 06 Jul, 15:52
James Pirz [Spark SQL] dependencies to use test helpers Wed, 24 Jul, 22:38
Jerry Vinokurov intermittent Kryo serialization failures in Spark Wed, 10 Jul, 16:50
Jerry Vinokurov Re: Spark Newbie question Thu, 11 Jul, 17:48
Jerry Vinokurov Re: [Beginner] Run compute on large matrices and return the result in seconds? Wed, 17 Jul, 20:27
Joevu unsubscribe Thu, 18 Jul, 09:07
Julien Laurenceau Re: Spark 2.4.3 with hadoop 3.2 docker image. Sat, 06 Jul, 18:22
Kamalanathan Venkatesan Spark structural streaming sinks output late Tue, 09 Jul, 13:54
Kamalanathan Venkatesan RE: Spark structural streaming sinks output late Wed, 10 Jul, 07:20
Kazuaki Ishizaki Re: Re: Release Apache Spark 2.4.4 before 3.0.0 Tue, 16 Jul, 10:59
Keith Chapman Re: Sorting tuples with byte key and byte value Tue, 16 Jul, 00:49
Keith Chapman Re: Long-Running Spark application doesn't clean old shuffle data correctly Sun, 21 Jul, 19:49
Kurt Fehlhauer Re: Learning Spark Fri, 05 Jul, 05:02
Kurt Fehlhauer Re: Learning Spark Fri, 05 Jul, 07:37
Latha Appanna [spark standalone mode] force spark to launch driver in a specific worker in cluster mode Fri, 26 Jul, 04:43
Luca Borin Apache Spark Log4j logging applicationId Wed, 24 Jul, 05:05
Magnus Nilsson Re: Spark structural streaming sinks output late Wed, 10 Jul, 08:30
Magnus Nilsson CPU:s per task Wed, 17 Jul, 11:29
Mario Amatucci RE: Avro large binary read memory problem Tue, 23 Jul, 17:10
Matt Cheah Re: k8s orchestrating Spark service Mon, 01 Jul, 22:26
Matt Cheah Re: k8s orchestrating Spark service Mon, 01 Jul, 23:45
Matt Cheah Re: k8s orchestrating Spark service Tue, 02 Jul, 00:14
Mich Talebzadeh Re: Spark dataset to explode json string Fri, 19 Jul, 21:26
Message list1 · 2 · Next »Thread · Author · Date
Box list
Oct 201967
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137