spark-user mailing list archives: September 2020

Site index · List index
Message list1 · 2 · Next »Thread · Author · Date
Albert Butterscotch Error while getting RDD partitions for a parquet dataframe in Spark 3 Tue, 01 Sep, 13:39
dwgw value col is not a member of org.apache.spark.rdd.RDD Wed, 02 Sep, 04:39
Filipa Sousa Adding isolation level when reading from DB2 with spark.read Wed, 02 Sep, 14:34
Luca Canali RE: Adding isolation level when reading from DB2 with spark.read Wed, 02 Sep, 15:10
Filipa Sousa RE: Adding isolation level when reading from DB2 with spark.read Wed, 02 Sep, 17:24
Jörg Strebel Re: Adding isolation level when reading from DB2 with spark.read Wed, 02 Sep, 19:23
Eric Beabes Submitting Spark Job thru REST API? Wed, 02 Sep, 20:58
Breno Arosa Re: Submitting Spark Job thru REST API? Wed, 02 Sep, 21:53
Amit Joshi Re: Submitting Spark Job thru REST API? Thu, 03 Sep, 02:46
tianlangstudio 回复:Submitting Spark Job thru REST API? Thu, 03 Sep, 07:06
András Kolbert Spark Streaming Checkpointing Thu, 03 Sep, 09:41
Eric Beabes Re: Submitting Spark Job thru REST API? Thu, 03 Sep, 18:47
Michael Segel Re: Merging Parquet Files Thu, 03 Sep, 18:52
Devi P.V Iterating all columns in a pyspark dataframe Fri, 04 Sep, 07:11
Gabor Somogyi Re: Spark Streaming Checkpointing Fri, 04 Sep, 12:09
Sean Owen Re: Iterating all columns in a pyspark dataframe Fri, 04 Sep, 12:27
András Kolbert Re: Spark Streaming Checkpointing Fri, 04 Sep, 12:42
Hamish Whittal Keeping track of how long something has been in a queue Fri, 04 Sep, 14:02
Hamish Whittal Re: Keeping track of how long something has been in a queue Fri, 04 Sep, 14:21
Ivan Petrov Spark Application REST API, looking for a way to kill specific task or executor Sat, 05 Sep, 14:41
Sandeep Patra Re: Spark Application REST API, looking for a way to kill specific task or executor Sat, 05 Sep, 15:42
Ivan Petrov Re: Spark Application REST API, looking for a way to kill specific task or executor Sat, 05 Sep, 17:15
Ankur Das Query about Spark Sun, 06 Sep, 13:30
☼ R Nair Re: Query about Spark Sun, 06 Sep, 13:43
☼ R Nair Re: Query about Spark Sun, 06 Sep, 13:45
Ankur Das Re: Query about Spark Mon, 07 Sep, 02:09
Jungtaek Lim Re: Keeping track of how long something has been in a queue Mon, 07 Sep, 06:40
jainshasha Elastic Search sink showing -1 for numOutputRows Mon, 07 Sep, 07:20
Enrico Minack Re: Query about Spark Mon, 07 Sep, 12:29
☼ R Nair Re: Query about Spark Mon, 07 Sep, 15:06
jainshasha Re: Elastic Search sink showing -1 for numOutputRows Mon, 07 Sep, 20:07
Jungtaek Lim Re: Elastic Search sink showing -1 for numOutputRows Mon, 07 Sep, 21:57
jainshasha Re: Elastic Search sink showing -1 for numOutputRows Tue, 08 Sep, 01:48
Ankur Das Re: Query about Spark Tue, 08 Sep, 04:00
Georg Heiler (TU Vienna) arbitrary state handling in python API Tue, 08 Sep, 11:21
Tom Scott [Spark Core] makeRDD() preferredLocations do not appear to be considered Tue, 08 Sep, 21:11
Joan subscribe user@spark.apache.org Wed, 09 Sep, 08:22
Ruijing Li Missing / Duplicate Data when Spark retries Thu, 10 Sep, 05:03
Rao, Abhishek (Nokia - IN/Bangalore) RE: Spark 3.0 using S3 taking long time for some set of TPC DS Queries Thu, 10 Sep, 07:26
Sean Owen Re: Missing / Duplicate Data when Spark retries Thu, 10 Sep, 13:01
Ruijing Li Re: Missing / Duplicate Data when Spark retries Thu, 10 Sep, 17:29
郑瑞峰 [ANNOUNCE] Announcing Apache Spark 3.0.1 Fri, 11 Sep, 08:52
Sean Owen Re: [DISCUSS] Spark cannot identify the problem executor Fri, 11 Sep, 12:42
Dongjoon Hyun Re: [ANNOUNCE] Announcing Apache Spark 3.0.1 Fri, 11 Sep, 12:49
Yi Wu Re: [DISCUSS] Spark cannot identify the problem executor Fri, 11 Sep, 13:24
Takeshi Yamamuro Re: [ANNOUNCE] Announcing Apache Spark 3.0.1 Fri, 11 Sep, 13:50
Gengliang Wang Re: [ANNOUNCE] Announcing Apache Spark 3.0.1 Fri, 11 Sep, 15:09
Wenchen Fan Re: [ANNOUNCE] Announcing Apache Spark 3.0.1 Fri, 11 Sep, 15:50
Teja Re: LiveListenerBus is occupying most of the Driver Memory and frequent GC is degrading the performance Fri, 11 Sep, 17:18
Teja Re: LiveListenerBus is occupying most of the Driver Memory and frequent GC is degrading the performance Fri, 11 Sep, 17:29
Tom Scott Re: [Spark Core] makeRDD() preferredLocations do not appear to be considered Sat, 12 Sep, 15:36
Yi Wu Re: [DISCUSS] Spark cannot identify the problem executor Mon, 14 Sep, 02:54
Eric Beabes Re: Submitting Spark Job thru REST API? Mon, 14 Sep, 21:50
Tarun Rajput Query /Bug Spark Streaming / Context Cleaner/ GC question Tue, 15 Sep, 21:05
Ivan Petrov Is there any good Docker container / compose with spark 2.4+ and YARN 2.8.2+ Wed, 16 Sep, 10:49
German Schiavon Structured Streaming Checkpoint Error Wed, 16 Sep, 14:11
jianyangusa Re: Spark Kafka Streaming With Transactional Messages Wed, 16 Sep, 18:39
Ricardo Martinelli de Oliveira Re: Is there any good Docker container / compose with spark 2.4+ and YARN 2.8.2+ Wed, 16 Sep, 18:44
Rishi Shah [pyspark 2.4] broadcasting DataFrame throws error Thu, 17 Sep, 04:13
Harsh Re: Spark structured streaming: periodically refresh static data frame Thu, 17 Sep, 09:46
Gabor Somogyi Re: Structured Streaming Checkpoint Error Thu, 17 Sep, 09:51
German Schiavon Re: Structured Streaming Checkpoint Error Thu, 17 Sep, 09:55
Kaden Cho unsubscribe Thu, 17 Sep, 10:22
roseyrathod456 Re: [DISCUSS] Spark cannot identify the problem executor Thu, 17 Sep, 11:12
Amit Joshi Re: [pyspark 2.4] broadcasting DataFrame throws error Fri, 18 Sep, 03:35
Shubham Chaurasia Pre query execution hook for custom datasources Fri, 18 Sep, 08:17
Vibhor Banga ( Engineering - VS) Spark streaming job not able to launch more number of executors Fri, 18 Sep, 12:19
Debabrata Ghosh Spark : Very simple query failing [Needed help please] Fri, 18 Sep, 13:10
Rishi Shah Re: [pyspark 2.4] broadcasting DataFrame throws error Sat, 19 Sep, 01:09
李继先 how to integrate hbase and hive in spark3.0.1? Sat, 19 Sep, 03:40
Amit Joshi Re: [pyspark 2.4] broadcasting DataFrame throws error Sat, 19 Sep, 05:31
mykidong UnknownHostException is thrown when spark job whose jar files will be uploaded to s3 object storage via https is submitted to kubernetes Sun, 20 Sep, 05:18
Ömer Ölmez Apache Spark Error. Sun, 20 Sep, 17:33
Hitesh Tiwari Re: UnknownHostException is thrown when spark job whose jar files will be uploaded to s3 object storage via https is submitted to kubernetes Sun, 20 Sep, 19:43
adilerman Exporting spark custom metrics via prometheus jmx exporter Mon, 21 Sep, 08:26
Rishi Shah Re: [pyspark 2.4] broadcasting DataFrame throws error Mon, 21 Sep, 15:55
Lyx 【Spark ML】How to get access of the MLlib's LogisticRegressionWithSGD after 3.0.0? Tue, 22 Sep, 05:53
Sean Owen Re: 【Spark ML】How to get access of the MLlib's LogisticRegressionWithSGD after 3.0.0? Tue, 22 Sep, 12:10
ER Spark Submit processes hanging & leaking memory Tue, 22 Sep, 19:15
Arya Ketan Is RDD.persist honoured if multiple actions are executed in parallel Wed, 23 Sep, 07:44
Sean Owen Re: Is RDD.persist honoured if multiple actions are executed in parallel Wed, 23 Sep, 12:35
Breno Arosa Bloom Filter to filter huge dataframes with PySpark Wed, 23 Sep, 14:58
Sergey Oboguev Spark watermarked aggregation query and append output mode Wed, 23 Sep, 20:47
German Schiavon Re: Spark watermarked aggregation query and append output mode Wed, 23 Sep, 21:39
Sergey Oboguev Re: Spark watermarked aggregation query and append output mode Wed, 23 Sep, 22:19
Arya Ketan Re: Is RDD.persist honoured if multiple actions are executed in parallel Thu, 24 Sep, 02:40
Marco Sassarini Edge AI with Spark Thu, 24 Sep, 07:19
ayan guha Re: Edge AI with Spark Thu, 24 Sep, 07:41
Gourav Sengupta Re: Edge AI with Spark Thu, 24 Sep, 09:42
Pedro Cardoso Distribute entire columns to executors Thu, 24 Sep, 09:51
Deepak Sharma Re: Edge AI with Spark Thu, 24 Sep, 13:16
Lalwani, Jayesh Re: Distribute entire columns to executors Thu, 24 Sep, 13:44
Gang Li Let multiple jobs share one rdd? Thu, 24 Sep, 13:52
Jeff Evans Re: Distribute entire columns to executors Thu, 24 Sep, 17:27
Andrew Mullins [Pyspark 3 Debug] Date values reset to Unix epoch Thu, 24 Sep, 18:02
Khalid Mammadov Re: Let multiple jobs share one rdd? Thu, 24 Sep, 18:17
EveLiao Re: [Pyspark 3 Debug] Date values reset to Unix epoch Thu, 24 Sep, 18:21
Andrew Mullins Re: [Pyspark 3 Debug] Date values reset to Unix epoch Thu, 24 Sep, 18:21
Michael Mior Re: Is RDD.persist honoured if multiple actions are executed in parallel Thu, 24 Sep, 18:26
javaguy Java A simple example that demonstrates that a Spark distributed cluster is faster than Spark Local Standalone Thu, 24 Sep, 18:43
Message list1 · 2 · Next »Thread · Author · Date
Box list
Jan 2021202
Dec 2020203
Nov 2020147
Oct 2020236
Sep 2020136
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137