spark-user mailing list archives: July 2016

Site index · List index
Message list1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · Next »Thread · Author · Date
RE: Spark jobs
Joaquin Alzola   RE: Spark jobs Fri, 01 Jul, 00:36
RE: Spark 2.0 Continuous Processing
kmat   RE: Spark 2.0 Continuous Processing Fri, 01 Jul, 01:01
johnzeng Looking for help about stackoverflow in spark Fri, 01 Jul, 02:03
Chanh Le   Re: Looking for help about stackoverflow in spark Fri, 01 Jul, 02:15
Re: One map per folder in spark or Hadoop
Sun Rui   Re: One map per folder in spark or Hadoop Fri, 01 Jul, 02:16
Balachandar R.A.     Re: One map per folder in spark or Hadoop Fri, 01 Jul, 02:27
Balachandar R.A.     Re: One map per folder in spark or Hadoop Thu, 07 Jul, 13:43
Deepak Sharma       Re: One map per folder in spark or Hadoop Thu, 07 Jul, 14:15
Chanh Le Why so many parquet file part when I store data in Alluxio or File? Fri, 01 Jul, 02:29
Deepak Sharma   Re: Why so many parquet file part when I store data in Alluxio or File? Fri, 01 Jul, 04:01
Chanh Le     Re: Why so many parquet file part when I store data in Alluxio or File? Fri, 01 Jul, 04:04
Deepak Sharma       Re: Why so many parquet file part when I store data in Alluxio or File? Fri, 01 Jul, 04:30
Ted Yu         Re: Why so many parquet file part when I store data in Alluxio or File? Fri, 01 Jul, 04:38
Chanh Le           Re: Why so many parquet file part when I store data in Alluxio or File? Fri, 01 Jul, 07:07
Ted Yu             Re: Why so many parquet file part when I store data in Alluxio or File? Fri, 01 Jul, 10:31
Chanh Le               Re: Why so many parquet file part when I store data in Alluxio or File? Mon, 04 Jul, 03:02
Gene Pang                 Re: Why so many parquet file part when I store data in Alluxio or File? Fri, 08 Jul, 13:33
Chanh Le                   Re: Why so many parquet file part when I store data in Alluxio or File? Fri, 08 Jul, 14:32
HiveContext
manish jaiswal   HiveContext Fri, 01 Jul, 03:38
manish jaiswal   HiveContext Fri, 01 Jul, 10:00
Mich Talebzadeh     Re: HiveContext Fri, 01 Jul, 14:35
shiv4nsh How spark makes partition when we insert data using the Sql query, and how the permissions to the partitions is assigned.? Fri, 01 Jul, 07:43
Mich Talebzadeh   Re: How spark makes partition when we insert data using the Sql query, and how the permissions to the partitions is assigned.? Fri, 01 Jul, 08:40
manoop JavaStreamingContext.stop() hangs Fri, 01 Jul, 08:12
chandan prakash   Re: JavaStreamingContext.stop() hangs Fri, 01 Jul, 11:22
Re: Remote RPC client disassociated
Akhil Das   Re: Remote RPC client disassociated Fri, 01 Jul, 10:38
Joaquin Alzola     RE: Remote RPC client disassociated Fri, 01 Jul, 10:45
Akhil Das       Re: Remote RPC client disassociated Fri, 01 Jul, 10:55
Re: RDD to DataFrame question with JsValue in the mix
Akhil Das   Re: RDD to DataFrame question with JsValue in the mix Fri, 01 Jul, 10:42
D...@ODDO     Re: RDD to DataFrame question with JsValue in the mix Fri, 01 Jul, 12:02
Re: How to spin up Kafka using docker and use for Spark Streaming Integration tests
Akhil Das   Re: How to spin up Kafka using docker and use for Spark Streaming Integration tests Fri, 01 Jul, 10:46
Lars Albertsson   Re: How to spin up Kafka using docker and use for Spark Streaming Integration tests Mon, 04 Jul, 12:20
swetha kasireddy     Re: How to spin up Kafka using docker and use for Spark Streaming Integration tests Thu, 07 Jul, 00:02
swetha kasireddy     Re: How to spin up Kafka using docker and use for Spark Streaming Integration tests Thu, 07 Jul, 01:14
Lars Albertsson       Re: How to spin up Kafka using docker and use for Spark Streaming Integration tests Sun, 10 Jul, 09:58
Rishabh Bhardwaj Deploying ML Pipeline Model Fri, 01 Jul, 11:54
Steve Goodman   Re: Deploying ML Pipeline Model Fri, 01 Jul, 12:44
Silvio Fiorito   Re: Deploying ML Pipeline Model Fri, 01 Jul, 14:36
Jacek Laskowski   Re: Deploying ML Pipeline Model Fri, 01 Jul, 17:23
Nick Pentreath     Re: Deploying ML Pipeline Model Fri, 01 Jul, 17:47
Jacek Laskowski       Re: Deploying ML Pipeline Model Fri, 01 Jul, 18:15
Nick Pentreath         Re: Deploying ML Pipeline Model Fri, 01 Jul, 19:16
Sean Owen           Re: Deploying ML Pipeline Model Fri, 01 Jul, 19:40
Nick Pentreath             Re: Deploying ML Pipeline Model Tue, 05 Jul, 09:36
Saurabh Sardeshpande       Re: Deploying ML Pipeline Model Fri, 01 Jul, 19:59
Nick Pentreath         Re: Deploying ML Pipeline Model Tue, 05 Jul, 09:42
Paolo Patierno Spark 2.0.0-preview ... problem with jackson core version Fri, 01 Jul, 14:24
ppatie...@live.com   Re: Spark 2.0.0-preview ... problem with jackson core version Fri, 01 Jul, 16:48
Sean Owen     Re: Spark 2.0.0-preview ... problem with jackson core version Fri, 01 Jul, 18:59
Charles Allen       Re: Spark 2.0.0-preview ... problem with jackson core version Fri, 01 Jul, 23:46
Sean Owen         Re: Spark 2.0.0-preview ... problem with jackson core version Sat, 02 Jul, 07:34
Paolo Patierno           RE: Spark 2.0.0-preview ... problem with jackson core version Sat, 02 Jul, 12:35
Sean Owen             Re: Spark 2.0.0-preview ... problem with jackson core version Sat, 02 Jul, 13:32
Paolo Patierno               RE: Spark 2.0.0-preview ... problem with jackson core version Sat, 02 Jul, 14:20
Paolo Patierno             RE: Spark 2.0.0-preview ... problem with jackson core version Sat, 02 Jul, 14:04
Sean Owen               Re: Spark 2.0.0-preview ... problem with jackson core version Sat, 02 Jul, 14:12
emiretsk How are threads created in SQL Executor? Fri, 01 Jul, 15:42
Takeshi Yamamuro   Re: How are threads created in SQL Executor? Fri, 01 Jul, 17:15
Re: Random Forest Classification
Rich Tarro   Re: Random Forest Classification Fri, 01 Jul, 16:24
Bryan Cutler     Re: Random Forest Classification Fri, 08 Jul, 17:54
Egor Pahomov Thrift JDBC server - why only one per machine and only yarn-client Fri, 01 Jul, 16:32
Jeff Zhang   Re: Thrift JDBC server - why only one per machine and only yarn-client Fri, 01 Jul, 17:10
Egor Pahomov     Re: Thrift JDBC server - why only one per machine and only yarn-client Fri, 01 Jul, 17:47
Takeshi Yamamuro       Re: Thrift JDBC server - why only one per machine and only yarn-client Fri, 01 Jul, 17:53
Jeff Zhang       Re: Thrift JDBC server - why only one per machine and only yarn-client Fri, 01 Jul, 17:54
Egor Pahomov         Re: Thrift JDBC server - why only one per machine and only yarn-client Fri, 01 Jul, 17:59
Jeff Zhang           Re: Thrift JDBC server - why only one per machine and only yarn-client Fri, 01 Jul, 18:12
Egor Pahomov             Re: Thrift JDBC server - why only one per machine and only yarn-client Fri, 01 Jul, 18:24
Egor Pahomov               Re: Thrift JDBC server - why only one per machine and only yarn-client Fri, 01 Jul, 20:38
Takeshi Yamamuro                 Re: Thrift JDBC server - why only one per machine and only yarn-client Sat, 02 Jul, 07:49
Ashic Mahtab Cluster mode deployment from jar in S3 Fri, 01 Jul, 16:45
Ashic Mahtab   RE: Cluster mode deployment from jar in S3 Mon, 04 Jul, 09:36
Lohith Samaga M     RE: Cluster mode deployment from jar in S3 Mon, 04 Jul, 09:50
Ashic Mahtab   RE: Cluster mode deployment from jar in S3 Mon, 04 Jul, 10:30
Steve Loughran     Re: Cluster mode deployment from jar in S3 Mon, 11 Jul, 16:11
Ashic Mahtab   RE: Cluster mode deployment from jar in S3 Mon, 04 Jul, 16:43
Re: Aggregator (Spark 2.0) skips aggregation is zero(0 returns null
Amit Sela   Re: Aggregator (Spark 2.0) skips aggregation is zero(0 returns null Fri, 01 Jul, 21:04
Koert Kuipers     Re: Aggregator (Spark 2.0) skips aggregation is zero(0 returns null Sat, 02 Jul, 04:34
Raajen Spark driver assigning splits to incorrect workers Fri, 01 Jul, 21:46
Ted Yu   Re: Spark driver assigning splits to incorrect workers Fri, 01 Jul, 22:03
Raajen Patel     Re: Spark driver assigning splits to incorrect workers Mon, 04 Jul, 15:07
Re: output part files max size
Kali.tumm...@gmail.com   Re: output part files max size Fri, 01 Jul, 23:26
Re: Best way to merge final output part files created by Spark job
Kali.tumm...@gmail.com   Re: Best way to merge final output part files created by Spark job Fri, 01 Jul, 23:36
Lalitha MV Enforcing shuffle hash join Fri, 01 Jul, 23:56
Takeshi Yamamuro   Re: Enforcing shuffle hash join Sat, 02 Jul, 07:58
Lalitha MV     Re: Enforcing shuffle hash join Mon, 04 Jul, 19:23
Takeshi Yamamuro       Re: Enforcing shuffle hash join Tue, 05 Jul, 05:17
Lalitha MV         Re: Enforcing shuffle hash join Tue, 05 Jul, 05:28
Takeshi Yamamuro           Re: Enforcing shuffle hash join Tue, 05 Jul, 05:29
Sun Rui           Re: Enforcing shuffle hash join Tue, 05 Jul, 05:56
Lalitha MV             Re: Enforcing shuffle hash join Tue, 05 Jul, 06:44
喜之郎               回复: Enforcing shuffle hash join Tue, 05 Jul, 08:59
Kali.tumm...@gmail.com spark parquet too many small files ? Sat, 02 Jul, 00:17
nsalian   Re: spark parquet too many small files ? Sat, 02 Jul, 00:35
Kali.tumm...@gmail.com     Re: spark parquet too many small files ? Sat, 02 Jul, 01:04
Kali.tumm...@gmail.com     Re: spark parquet too many small files ? Sat, 02 Jul, 02:39
Takeshi Yamamuro       Re: spark parquet too many small files ? Sat, 02 Jul, 07:53
sri hari kali charan Tummala         Re: spark parquet too many small files ? Sat, 02 Jul, 17:36
Re: Spark ML - Java implementation of custom Transformer
Yanbo Liang   Re: Spark ML - Java implementation of custom Transformer Sat, 02 Jul, 07:23
Re: Custom Optimizer
Yanbo Liang   Re: Custom Optimizer Sat, 02 Jul, 07:28
Re: Ideas to put a Spark ML model in production
Yanbo Liang   Re: Ideas to put a Spark ML model in production Sat, 02 Jul, 07:45
Alexey Pechorin     Re: Ideas to put a Spark ML model in production Sun, 03 Jul, 09:06
Re: Get both feature importance and ROC curve from a random forest classifier
Yanbo Liang   Re: Get both feature importance and ROC curve from a random forest classifier Sat, 02 Jul, 08:04
Mathieu D     Re: Get both feature importance and ROC curve from a random forest classifier Wed, 06 Jul, 13:26
Re: Trainning a spark ml linear regresion model fail after migrating from 1.5.2 to 1.6.1
Yanbo Liang   Re: Trainning a spark ml linear regresion model fail after migrating from 1.5.2 to 1.6.1 Sat, 02 Jul, 08:19
Re: Several questions about how pyspark.ml works
Yanbo Liang   Re: Several questions about how pyspark.ml works Sat, 02 Jul, 08:30
Gil Vernik Spark-13979: issues with hadoopConf Sat, 02 Jul, 12:06
Biplob Biswas Working of Streaming Kmeans Sat, 02 Jul, 14:48
Biplob Biswas   Re: Working of Streaming Kmeans Sun, 03 Jul, 15:11
Holden Karau     Working of Streaming Kmeans Tue, 05 Jul, 19:59
latest version of Spark to work OK as Hive engine
Ashok Kumar   latest version of Spark to work OK as Hive engine Sat, 02 Jul, 18:30
Renxia Wang Bootstrap Action to Install Spark 2.0 on EMR? Sun, 03 Jul, 05:46
Holden Karau   Re: Bootstrap Action to Install Spark 2.0 on EMR? Tue, 05 Jul, 20:00
Paolo Patierno AMQP extension for Apache Spark Streaming (messaging/IoT) Sun, 03 Jul, 08:41
Darren Govoni   RE: AMQP extension for Apache Spark Streaming (messaging/IoT) Sun, 03 Jul, 12:31
Re: 'numBins' property not honoured in BinaryClassificationMetrics class when spark.default.parallelism is not set to 1
sneha29shukla   Re: 'numBins' property not honoured in BinaryClassificationMetrics class when spark.default.parallelism is not set to 1 Sun, 03 Jul, 10:04
Sean Owen   Re: 'numBins' property not honoured in BinaryClassificationMetrics class when spark.default.parallelism is not set to 1 Sun, 03 Jul, 11:36
Joaquin Alzola JAr files into python3 Sun, 03 Jul, 20:01
Mich Talebzadeh Saving parquet table as uncompressed with write.mode("overwrite"). Sun, 03 Jul, 21:42
Ted Yu   Re: Saving parquet table as uncompressed with write.mode("overwrite"). Sun, 03 Jul, 22:21
Mich Talebzadeh     Re: Saving parquet table as uncompressed with write.mode("overwrite"). Sun, 03 Jul, 22:39
Mich Talebzadeh       Re: Saving parquet table as uncompressed with write.mode("overwrite"). Sun, 03 Jul, 22:55
Arun Patel Graphframe Error Sun, 03 Jul, 22:48
Yanbo Liang   Re: Graphframe Error Mon, 04 Jul, 08:37
Felix Cheung     Re: Graphframe Error Mon, 04 Jul, 09:33
Arun Patel       Re: Graphframe Error Tue, 05 Jul, 12:37
Felix Cheung       Re: Graphframe Error Wed, 06 Jul, 04:45
Arun Patel         Re: Graphframe Error Thu, 07 Jul, 11:13
Felix Cheung         Re: Graphframe Error Fri, 08 Jul, 09:03
Pedro Rodriguez Custom RDD: Report Size of Partition in Bytes to Spark Mon, 04 Jul, 02:46
Takeshi Yamamuro   Re: Custom RDD: Report Size of Partition in Bytes to Spark Mon, 04 Jul, 04:31
Pedro Rodriguez     Re: Custom RDD: Report Size of Partition in Bytes to Spark Mon, 04 Jul, 14:32
Chanh Le How to struct data in parquet format? Mon, 04 Jul, 03:28
Message list1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · Next »Thread · Author · Date
Box list
Aug 201994
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137