spark-user mailing list archives: April 2019

Site index · List index
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
ch...@inaccel.com How to speedup your Spark ML training Mon, 15 Apr, 12:21
M Bilal [GraphX] Preserving Partitions when reading from HDFS Mon, 15 Apr, 15:28
Manu Zhang   Re: [GraphX] Preserving Partitions when reading from HDFS Tue, 16 Apr, 05:00
M Bilal     Re: [GraphX] Preserving Partitions when reading from HDFS Thu, 25 Apr, 09:17
Eugene Koifman JvmPauseMonitor Mon, 15 Apr, 16:52
Arun Mahadevan   Re: JvmPauseMonitor Mon, 15 Apr, 22:21
Nikhil Chinnapa K8s-Spark client mode : Executor image not able to download application jar from driver Tue, 16 Apr, 08:20
Stavros Kontopoulos   Re: K8s-Spark client mode : Executor image not able to download application jar from driver Sat, 20 Apr, 00:01
Nikhil Chinnapa     Re: K8s-Spark client mode : Executor image not able to download application jar from driver Sun, 28 Apr, 04:03
Stavros Kontopoulos       Re: K8s-Spark client mode : Executor image not able to download application jar from driver Sun, 28 Apr, 19:07
Nikhil Chinnapa         Re: K8s-Spark client mode : Executor image not able to download application jar from driver Mon, 29 Apr, 05:26
Fwd: Issue with spark while reading from avro file
Prateek Rajput   Fwd: Issue with spark while reading from avro file Tue, 16 Apr, 11:38
Prateek Rajput     Re: Issue with spark while reading from avro file Wed, 24 Apr, 08:02
Gorka Bravo Martinez Reading RDD by (key, data) from s3 Tue, 16 Apr, 12:47
yujhe.li   Re: Reading RDD by (key, data) from s3 Wed, 17 Apr, 01:58
Rishikesh Gawade How to use same SparkSession in another app? Tue, 16 Apr, 17:57
Jacek Laskowski   Re: How to use same SparkSession in another app? Wed, 17 Apr, 02:45
Femi Anthony   Re: [External Sender] How to use same SparkSession in another app? Wed, 17 Apr, 02:56
purna pradeep Dynamic executor scaling spark/Kubernetes Tue, 16 Apr, 21:20
Balakumar iyer S An alternative logic to collaborative filtering works fine but we are facing run time issues in executing the job Wed, 17 Apr, 04:12
Ankit Khettry   Re: An alternative logic to collaborative filtering works fine but we are facing run time issues in executing the job Wed, 17 Apr, 05:05
Gorka Bravo Martinez Boto3 library send to pyspark Wed, 17 Apr, 07:11
Gourav Sengupta   Re: Boto3 library send to pyspark Wed, 17 Apr, 08:06
Gorka Bravo Martinez   RE: Boto3 library send to pyspark Wed, 17 Apr, 10:42
Sebastian Schere     Re: Boto3 library send to pyspark Wed, 17 Apr, 12:26
Gourav Sengupta     Re: Boto3 library send to pyspark Wed, 17 Apr, 20:42
rajat kumar Spark job running for long time Wed, 17 Apr, 13:22
Yeikel   Re: Spark job running for long time Wed, 17 Apr, 18:05
rajat kumar     Re: Spark job running for long time Wed, 17 Apr, 18:11
Yeikel       Re: Spark job running for long time Wed, 17 Apr, 18:13
rajat kumar         Re: Spark job running for long time Sun, 21 Apr, 09:48
Re: cache table vs. parquet table performance
Bin Fan   Re: cache table vs. parquet table performance Thu, 18 Apr, 05:34
Mike Chan autoBroadcastJoinThreshold not working as expected Thu, 18 Apr, 09:44
Mike Chan   Fwd: autoBroadcastJoinThreshold not working as expected Wed, 24 Apr, 04:38
Mike Chan     Fwd: autoBroadcastJoinThreshold not working as expected Wed, 24 Apr, 08:32
Juho Autio [Spark SQL]: Slow insertInto overwrite if target table has many partitions Thu, 18 Apr, 13:45
Juho Autio   Re: [Spark SQL]: Slow insertInto overwrite if target table has many partitions Thu, 25 Apr, 07:01
vincent gromakowski     Re: [Spark SQL]: Slow insertInto overwrite if target table has many partitions Thu, 25 Apr, 07:12
Khare, Ankit       Re: [Spark SQL]: Slow insertInto overwrite if target table has many partitions Thu, 25 Apr, 07:47
Juho Autio       Re: [Spark SQL]: Slow insertInto overwrite if target table has many partitions Thu, 25 Apr, 13:10
vincent gromakowski         Re: [Spark SQL]: Slow insertInto overwrite if target table has many partitions Thu, 25 Apr, 14:46
Xiao Li           Re: [Spark SQL]: Slow insertInto overwrite if target table has many partitions Thu, 25 Apr, 15:16
Juho Autio           Re: [Spark SQL]: Slow insertInto overwrite if target table has many partitions Thu, 25 Apr, 15:16
van den Heever, Christian CC             RE: [Spark SQL]: Slow insertInto overwrite if target table has many partitions Fri, 26 Apr, 10:14
Subash Prabakar Difference between Checkpointing and Persist Thu, 18 Apr, 17:49
Jack Kolokasis   Re: Difference between Checkpointing and Persist Thu, 18 Apr, 17:57
Vadim Semenov   Re: Difference between Checkpointing and Persist Thu, 18 Apr, 18:09
Gene Pang   Re: Difference between Checkpointing and Persist Fri, 19 Apr, 20:58
Mann Du Spark-submit and no java log file generated Thu, 18 Apr, 23:20
Jason Dai BigDL and Analytics Zoo talks at upcoming Spark+AI Summit and Strata London Thu, 18 Apr, 23:35
Khare, Ankit   Re: BigDL and Analytics Zoo talks at upcoming Spark+AI Summit and Strata London Fri, 19 Apr, 13:55
Re: Error: NoSuchFieldError: HIVE_STATS_JDBC_TIMEOUT while running a Spark-Hive Job
rajiv shah   Re: Error: NoSuchFieldError: HIVE_STATS_JDBC_TIMEOUT while running a Spark-Hive Job Fri, 19 Apr, 18:05
swastik mittal Not able to convert Image binary to an image Sat, 20 Apr, 04:51
Subash Prabakar Feature engineering ETL for machine learning Sat, 20 Apr, 14:13
kanchan tewary toDebugString - RDD Logical Plan Sat, 20 Apr, 17:40
Dylan Guedes   Re: toDebugString - RDD Logical Plan Sat, 20 Apr, 18:41
kanchan tewary     Re: toDebugString - RDD Logical Plan Tue, 23 Apr, 15:48
kumar.rajat20del repartition in df vs partitionBy in df Sat, 20 Apr, 18:48
rajat kumar   Re: repartition in df vs partitionBy in df Thu, 25 Apr, 04:50
moqi     Re: repartition in df vs partitionBy in df Thu, 25 Apr, 05:01
rajat kumar       Re: repartition in df vs partitionBy in df Thu, 25 Apr, 05:10
moqi         Re: repartition in df vs partitionBy in df Thu, 25 Apr, 05:29
moqi         Re: repartition in df vs partitionBy in df Thu, 25 Apr, 05:35
Stephen Boesch How to execute non-timestamp-based aggregations in spark structured streaming? Sat, 20 Apr, 21:17
Tathagata Das   Re: How to execute non-timestamp-based aggregations in spark structured streaming? Mon, 22 Apr, 17:10
Re: Difference between 'cores' config params: spark submit on k8s
Li Gao   Re: Difference between 'cores' config params: spark submit on k8s Sat, 20 Apr, 21:43
Mich Talebzadeh Writing to Aerospike from Spark with bulk write with user authentication fails Sun, 21 Apr, 10:30
Mich Talebzadeh   Re: Writing to Aerospike from Spark with bulk write with user authentication fails Sun, 21 Apr, 17:59
Chetan Khatri Usage of Explicit Future in Spark program Sun, 21 Apr, 18:58
Rishi Shah Use derived column for other derived column in the same statement Mon, 22 Apr, 03:15
Shraddha Shah   Re: Use derived column for other derived column in the same statement Mon, 22 Apr, 03:25
Vipul Rajan     Re: Use derived column for other derived column in the same statement Mon, 22 Apr, 16:18
Rishi Shah   RDD vs Dataframe & when to persist Thu, 25 Apr, 02:50
shicheng31...@gmail.com Structured Streaming initialized with cached data or others Mon, 22 Apr, 10:00
Vipul Rajan   Re: Structured Streaming initialized with cached data or others Mon, 22 Apr, 16:03
Rishikesh Gawade Connecting to Spark cluster remotely Mon, 22 Apr, 14:52
Rishikesh Gawade   Re: Connecting to Spark cluster remotely Mon, 22 Apr, 21:25
Andrew Melo     Re: Connecting to Spark cluster remotely Mon, 22 Apr, 21:41
Chetan Khatri Update / Delete records in Parquet Mon, 22 Apr, 19:01
Jason Nerothin   Re: Update / Delete records in Parquet Mon, 22 Apr, 21:32
Chetan Khatri     Re: Update / Delete records in Parquet Tue, 23 Apr, 03:56
Khare, Ankit       Re: Update / Delete records in Parquet Tue, 23 Apr, 08:35
Qian He Spark LogisticRegression got stuck on dataset with millions of columns Tue, 23 Apr, 00:02
Weichen Xu   Re: Spark LogisticRegression got stuck on dataset with millions of columns Tue, 23 Apr, 22:15
Qian He     Re: Spark LogisticRegression got stuck on dataset with millions of columns Tue, 23 Apr, 23:10
Weichen Xu       Re: Spark LogisticRegression got stuck on dataset with millions of columns Tue, 23 Apr, 23:35
yutaochina can't download 2.4.1 sourcecode Tue, 23 Apr, 03:54
Andrew Melo   Re: can't download 2.4.1 sourcecode Tue, 23 Apr, 03:56
1101300123     Re: can't download 2.4.1 sourcecode Tue, 23 Apr, 04:10
Koert Kuipers spark 2.4.1 -> 3.0.0-SNAPSHOT mllib Tue, 23 Apr, 22:38
Shyam P spark stddev() giving '?' as output how to handle it ? i.e replace null/0 Wed, 24 Apr, 06:28
Shyam P   Re: spark stddev() giving '?' as output how to handle it ? i.e replace null/0 Wed, 24 Apr, 06:48
kanchan tewary Handle empty partitions in pyspark Wed, 24 Apr, 06:31
Shubham Chaurasia DataFrameWriter does not adjust spark.sql.session.timeZone offset while writing orc files Wed, 24 Apr, 10:29
Wenchen Fan   Re: DataFrameWriter does not adjust spark.sql.session.timeZone offset while writing orc files Wed, 24 Apr, 12:53
Shubham Chaurasia     Re: DataFrameWriter does not adjust spark.sql.session.timeZone offset while writing orc files Wed, 24 Apr, 13:18
Wenchen Fan       Re: DataFrameWriter does not adjust spark.sql.session.timeZone offset while writing orc files Wed, 24 Apr, 15:32
SNEHASISH DUTTA Handle Null Columns in Spark Structured Streaming Kafka Wed, 24 Apr, 14:24
Shixiong(Ryan) Zhu   Re: Handle Null Columns in Spark Structured Streaming Kafka Mon, 29 Apr, 22:56
Jason Nerothin     Re: Handle Null Columns in Spark Structured Streaming Kafka Mon, 29 Apr, 23:27
Jason Nerothin       Re: Handle Null Columns in Spark Structured Streaming Kafka Mon, 29 Apr, 23:28
SNEHASISH DUTTA         Re: Handle Null Columns in Spark Structured Streaming Kafka Tue, 30 Apr, 10:28
kineret M 'No plan for EventTimeWatermark' error while using structured streaming with column pruning (spark 2.3.1) Wed, 24 Apr, 14:30
Jeff Evans Is it possible to obtain the full command to be invoked by SparkLauncher? Wed, 24 Apr, 20:54
Marcelo Vanzin   Re: Is it possible to obtain the full command to be invoked by SparkLauncher? Wed, 24 Apr, 20:58
Marcelo Vanzin     Re: Is it possible to obtain the full command to be invoked by SparkLauncher? Wed, 24 Apr, 21:18
Jeff Evans       Re: Is it possible to obtain the full command to be invoked by SparkLauncher? Thu, 25 Apr, 02:01
Sebastian Piu   Re: Is it possible to obtain the full command to be invoked by SparkLauncher? Wed, 24 Apr, 21:08
Rishi Shah [pyspark] Use output of one aggregated function for another aggregated function within the same groupby Thu, 25 Apr, 03:07
Georg Heiler   Re: [pyspark] Use output of one aggregated function for another aggregated function within the same groupby Thu, 25 Apr, 04:00
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
Box list
Sep 202180
Aug 2021171
Jul 2021158
Jun 2021179
May 2021187
Apr 2021267
Mar 2021346
Feb 2021166
Jan 2021242
Dec 2020203
Nov 2020147
Oct 2020236
Sep 2020136
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137