spark-user mailing list archives: January 2018

Site index · List index
Message list1 · 2 · 3 · 4 · Next »Thread · Author · Date
446463...@qq.com use kafka streams API aggregate ? Tue, 30 Jan, 14:48
446463...@qq.com 回复: Re: use kafka streams API aggregate ? Tue, 30 Jan, 15:08
☼ R Nair (रविशंकर नायर) Re: Spark Tuning Tool Tue, 23 Jan, 23:34
张万新 spark.sql.adaptive.enabled has no effect Tue, 30 Jan, 12:26
郭鹏飞 Re: use kafka streams API aggregate ? Tue, 30 Jan, 15:06
韩盼 unsubscribe Mon, 29 Jan, 06:28
Sara Galindo Martínez Logback + Spark 2.2.0 Thu, 11 Jan, 15:02
Onur EKİNCİ Run jobs in parallel in standalone mode Tue, 16 Jan, 08:00
강민우 Is there alternative HiveStoragePredicateHandler#decomposePredicate? Fri, 12 Jan, 05:43
Onur EKİNCİ RE: Run jobs in parallel in standalone mode Tue, 16 Jan, 09:01
Onur EKİNCİ RE: Run jobs in parallel in standalone mode Tue, 16 Jan, 10:06
Onur EKİNCİ RE: Run jobs in parallel in standalone mode Tue, 16 Jan, 11:16
Onur EKİNCİ RE: Run jobs in parallel in standalone mode Tue, 16 Jan, 12:19
Onur EKİNCİ RE: Run jobs in parallel in standalone mode Tue, 16 Jan, 13:01
namesuperwood uncontinuous offset in kafka will cause the spark streaming failure Wed, 24 Jan, 05:48
namesuperwood Re: uncontinuous offset in kafka will cause the spark streamingfailure Wed, 24 Jan, 06:45
namesuperwood Re: uncontinuous offset in kafka will cause the spark streamingfailure Wed, 24 Jan, 07:19
Jörn Franke Re: [Spark SQL] How to run a custom meta query for `ANALYZE TABLE` Wed, 03 Jan, 06:22
Jörn Franke Re: 3rd party hadoop input formats for EDI formats Mon, 15 Jan, 18:42
Jörn Franke Re: [Spark ML] Positive-Only Training Classification in Scala Mon, 15 Jan, 19:04
Jörn Franke Re: Spark Streaming not reading missed data Tue, 16 Jan, 21:16
Jörn Franke Re: Saving each line of RDD as a separate file with key as the file name Sat, 20 Jan, 22:03
Jörn Franke Re: Reading Hive RCFiles? Sat, 20 Jan, 23:55
Jörn Franke Re: Processing huge amount of data from paged API Sun, 21 Jan, 21:26
Jörn Franke Re: run spark job in yarn cluster mode as specified user Mon, 22 Jan, 13:28
Jörn Franke Re: S3 token times out during data frame "write.csv" Tue, 23 Jan, 23:03
Jörn Franke Re: S3 token times out during data frame "write.csv" Sun, 28 Jan, 08:20
Aakash Basu Spark MLLib vs. SciKitLearn Fri, 19 Jan, 13:42
Aakash Basu Re: Spark MLLib vs. SciKitLearn Sat, 20 Jan, 17:44
Aakash Basu Is there any Spark ML or MLLib API for GINI for Model Evaluation? Please help! [EOM] Mon, 22 Jan, 07:04
Aakash Basu [Help] Converting a Python Numpy code into Spark using RDD Mon, 22 Jan, 07:37
Aakash Basu [Doubt] GridSearch for Hyperparameter Tuning in Spark Tue, 30 Jan, 12:31
Alex Nastetsky Dataset API inconsistencies Wed, 10 Jan, 00:45
Alonso Isidoro Roman Re: Testing Spark-Cassandra Wed, 17 Jan, 16:19
Andrew Ash Re: Palantir replease under org.apache.spark? Tue, 09 Jan, 19:27
Antoine Bonnin Optimize sort merge join Sat, 27 Jan, 14:17
Anton Puzanov Current way of using functions.window with Java Tue, 02 Jan, 14:05
Anton Puzanov Using window function works extremely slowly Mon, 22 Jan, 08:59
Anu B Nair Java heap space OutOfMemoryError in pyspark spark-submit (spark version:2.2) Fri, 05 Jan, 06:08
Anu B Nair unsubscribe Wed, 17 Jan, 05:20
Anu B Nair Unsubscribe Fri, 19 Jan, 06:11
Arnav kumar Type Casting Error in Spark Data Frame Mon, 29 Jan, 21:26
Arnav kumar Issue with Cast in Spark Sql Wed, 31 Jan, 02:48
Bill Schwanitz Re: Run jobs in parallel in standalone mode Tue, 16 Jan, 12:39
Biplob Biswas Prefer Structured Streaming over Spark Streaming (DStreams)? Wed, 31 Jan, 10:35
Bogdan Cojocar Spark structured streaming time series forecasting Mon, 08 Jan, 15:04
Bryan Cutler Re: Timestamp changing while writing Mon, 15 Jan, 21:57
Bryan Cutler Re: ML:One vs Rest with crossValidator for multinomial in logistic regression Tue, 30 Jan, 22:10
CCInCharge Custom Catalyst Optimizer Strategy for DataFrame Writes? Sat, 27 Jan, 23:17
Chandu Spark Standalone Mode, application runs, but executor is killed Fri, 26 Jan, 03:09
Chandu Re: Best active groups, forums or contacts for Spark ? Fri, 26 Jan, 13:18
Chandu Re: Spark Standalone Mode, application runs, but executor is killed Fri, 26 Jan, 14:34
Chandu Re: Spark Standalone Mode, application runs, but executor is killed Fri, 26 Jan, 14:35
Chandu Re: Spark Standalone Mode, application runs, but executor is killed Fri, 26 Jan, 14:36
Chandu Re: Spark Standalone Mode, application runs, but executor is killed Fri, 26 Jan, 14:41
Chandu Re: Best active groups, forums or contacts for Spark ? Fri, 26 Jan, 14:42
Christiaan Ras Re: [Spark structured streaming] Use of (flat)mapgroupswithstate takes long time Tue, 23 Jan, 10:45
Christiaan Ras [Structured streaming] Merging streaming with semi-static datasets Tue, 23 Jan, 11:32
Christopher Piggott binaryFiles() on directory full of directories Mon, 08 Jan, 15:03
Christopher Piggott Spark MakeRDD preferred workers Mon, 08 Jan, 20:51
Cody Koeninger Re: "Got wrong record after seeking to offset" issue Thu, 18 Jan, 04:12
Cody Koeninger Re: "Got wrong record after seeking to offset" issue Thu, 18 Jan, 20:39
Cody Koeninger Re: uncontinuous offset in kafka will cause the spark streamingfailure Wed, 24 Jan, 17:59
Cody Koeninger Re: Providing Kafka configuration as Map of Strings Wed, 24 Jan, 22:31
Conconscious Spark querying C* in Scala Mon, 22 Jan, 13:43
Conconscious Re: Spark querying C* in Scala Tue, 23 Jan, 12:22
Conconscious Custom build - missing images on MasterWebUI Thu, 25 Jan, 18:09
David Rosenstrauch How to hold some data in memory while processing rows in a DataFrame? Tue, 23 Jan, 03:24
David Rosenstrauch Re: How to hold some data in memory while processing rows in a DataFrame? Tue, 23 Jan, 18:52
David Rosenstrauch Re: How to hold some data in memory while processing rows in a DataFrame? Tue, 23 Jan, 18:52
Deepak Sharma CI/CD for spark and scala Thu, 25 Jan, 03:52
Dibyendu Bhattacharya why groupByKey still shuffle if SQL does "Distribute By" on same columns ? Wed, 31 Jan, 03:51
Divya Gehlot Spark Structured Streaming for Twitter Streaming data Wed, 31 Jan, 07:26
Dongjoon Hyun Vectorized ORC Reader in Apache Spark 2.3 with Apache ORC 1.4.1. Wed, 10 Jan, 19:14
Dongjoon Hyun Re: Vectorized ORC Reader in Apache Spark 2.3 with Apache ORC 1.4.1. Sun, 28 Jan, 18:40
Donni Khan Singular Value Decomposition (SVD) in Spark Java Wed, 31 Jan, 13:55
Esa Heikkinen Spark and CEP type examples Mon, 22 Jan, 11:38
Esa Heikkinen Best active groups, forums or contacts for Spark ? Fri, 26 Jan, 11:15
Eyal Zituny Re: Run jobs in parallel in standalone mode Tue, 16 Jan, 09:12
Eyal Zituny Re: Run jobs in parallel in standalone mode Tue, 16 Jan, 12:29
Fawze Abujaber Re: Spark job failing on jackson dependencies Sat, 06 Jan, 20:50
Fawze Abujaber Re: Spark application on yarn cluster clarification Thu, 18 Jan, 08:20
Fawze Abujaber Scala version changed in spark job Thu, 25 Jan, 06:40
Felix Cheung Re: Is Apache Spark-2.2.1 compatible with Hadoop-3.0.0 Mon, 08 Jan, 23:33
Felix Cheung Re: py4j.protocol.Py4JJavaError: An error occurred while calling o794.parquet Thu, 11 Jan, 00:43
Fernando Pereira End of Stream errors in shuffle Mon, 15 Jan, 10:32
Franco Victorio Semi-supervised learning in MLlib Sat, 27 Jan, 17:29
Gengliang Wang Re: Inner join with the table itself Mon, 15 Jan, 10:27
Georg Heiler Re: [Spark ML] Positive-Only Training Classification in Scala Mon, 15 Jan, 19:55
Georg Heiler Re: [Spark ML] Positive-Only Training Classification in Scala Mon, 15 Jan, 20:07
Gerard Maas Re: can HDFS be a streaming source like Kafka in Spark 2.2.0? Mon, 15 Jan, 23:45
Gevorg Hari Spark MLlib Question - Online Scoring of PipelineModel Fri, 05 Jan, 18:23
Gourav Sengupta Re: Spark on EMR suddenly stalling Tue, 02 Jan, 10:14
Gourav Sengupta Re: Spark Monitoring using Jolokia Tue, 09 Jan, 13:18
Gourav Sengupta Re: PIG to Spark Tue, 09 Jan, 13:31
Gourav Sengupta Re: can HDFS be a streaming source like Kafka in Spark 2.2.0? Tue, 16 Jan, 03:47
Gourav Sengupta Re: can HDFS be a streaming source like Kafka in Spark 2.2.0? Tue, 16 Jan, 05:11
Gourav Sengupta Re: Get broadcast (set in one method) in another method Thu, 25 Jan, 23:00
Gourav Sengupta Re: S3 token times out during data frame "write.csv" Sun, 28 Jan, 05:49
Guillermo Ortiz Testing Spark-Cassandra Wed, 17 Jan, 15:48
Message list1 · 2 · 3 · 4 · Next »Thread · Author · Date
Box list
Dec 201926
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137