spark-user mailing list archives: May 2015

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · Next »Thread · Author · Date
Def_Os Pandas timezone problems Thu, 21 May, 22:16
jstripit Re: NegativeArraySizeException when doing joins on skewed data Thu, 21 May, 22:47
Xiangrui Meng Re: Pandas timezone problems Fri, 22 May, 00:06
Ruslan Dautkhanov Re: PySpark Logs location Fri, 22 May, 00:22
Davies Liu Re: [pyspark] Starting workers in a virtualenv Fri, 22 May, 01:15
anneywarlord Kmeans Labeled Point RDD Fri, 22 May, 01:19
Krishna Sankar Re: Kmeans Labeled Point RDD Fri, 22 May, 01:30
Burak Yavuz Re: foreach plus accumulator Vs mapPartitions performance Fri, 22 May, 01:31
邓刚[技术中心] task all finished, while the stage marked finish long time later problem Fri, 22 May, 02:31
Dibyendu Bhattacharya Re: Spark Streaming with Tachyon : Data Loss on Receiver Failure due to WAL error Fri, 22 May, 03:35
Dani Qiu LDA prediction on new document Fri, 22 May, 03:48
donhoff_h 回复: How to use spark to access HBase with Security enabled Fri, 22 May, 06:11
SLiZn Liu Re: DataFrame Column Alias problem Fri, 22 May, 06:22
Ken Geis Re: LDA prediction on new document Fri, 22 May, 06:23
Reynold Xin Re: DataFrame Column Alias problem Fri, 22 May, 06:47
Reynold Xin Re: rdd.sample() methods very slow Fri, 22 May, 06:52
Karlson Re: [pyspark] Starting workers in a virtualenv Fri, 22 May, 06:56
Dani Qiu Re: LDA prediction on new document Fri, 22 May, 07:07
SLiZn Liu Re: DataFrame Column Alias problem Fri, 22 May, 07:21
Ted Yu Re: 回复: How to use spark to access HBase with Security enabled Fri, 22 May, 07:25
swaranga Spark Memory management Fri, 22 May, 07:31
donhoff_h 回复: 回复: How to use spark to access HBase with Security enabled Fri, 22 May, 07:33
Akhil Das Re: Spark Memory management Fri, 22 May, 07:54
董帅阳 Re: Spark Memory management Fri, 22 May, 08:00
SparknewUser MLlib: how to get the best model with only the most significant explanatory variables in LogisticRegressionWithLBFGS or LogisticRegressionWithSGD ? Fri, 22 May, 08:19
lucas Spark with cassandra Fri, 22 May, 08:21
Gautam Bajaj Re: Storing spark processed output to Database asynchronously. Fri, 22 May, 08:25
董帅阳 Re: Official Docker container for Spark Fri, 22 May, 08:32
Antonio Giambanco Spark Streaming and Drools Fri, 22 May, 08:43
Ritesh Kumar Singh Re: Official Docker container for Spark Fri, 22 May, 08:45
Karlson Partitioning of Dataframes Fri, 22 May, 09:03
Skanda Re: Issues with constants in Spark HiveQL queries Fri, 22 May, 09:06
Skanda Re: Issues with constants in Spark HiveQL queries Fri, 22 May, 09:20
Rok Roskar Re: FetchFailedException and MetadataFetchFailedException Fri, 22 May, 09:40
Evo Eftimov RE: Spark Streaming and Drools Fri, 22 May, 10:00
Antonio Giambanco Re: Spark Streaming and Drools Fri, 22 May, 10:07
Evo Eftimov RE: Spark Streaming and Drools Fri, 22 May, 10:19
Evo Eftimov RE: Spark Streaming and Drools Fri, 22 May, 10:22
Dibyendu Bhattacharya Re: Spark Streaming and Drools Fri, 22 May, 10:22
Evo Eftimov RE: Spark Streaming and Drools Fri, 22 May, 10:24
gtanguy DataFrame groupBy vs RDD groupBy Fri, 22 May, 10:35
ayan guha Re: Partitioning of Dataframes Fri, 22 May, 10:55
Ted Yu Re: 回复: 回复: How to use spark to access HBase with Security enabled Fri, 22 May, 11:25
Guillermo Ortiz Trying to connect to many topics with several DirectConnect Fri, 22 May, 11:50
Karlson Re: Partitioning of Dataframes Fri, 22 May, 12:48
Silvio Fiorito Re: Partitioning of Dataframes Fri, 22 May, 13:12
Hugo Ferreira Parallel parameter tuning: distributed execution of MLlib algorithms Fri, 22 May, 13:15
Charles Earl LDA prediction on new document Fri, 22 May, 13:28
Karlson Re: Partitioning of Dataframes Fri, 22 May, 13:57
Cesar Flores partitioning after extracting from a hive table? Fri, 22 May, 14:02
Cody Koeninger Re: Trying to connect to many topics with several DirectConnect Fri, 22 May, 14:12
Frank Staszak Re: How to use spark to access HBase with Security enabled Fri, 22 May, 15:16
Ted Yu Re: Partitioning of Dataframes Fri, 22 May, 16:11
Shay Seng Performance degradation between spark 0.9.3 and 1.3.1 Fri, 22 May, 16:43
Shay Seng Help reading Spark UI tea leaves.. Fri, 22 May, 16:59
Josh Rosen Re: Performance degradation between spark 0.9.3 and 1.3.1 Fri, 22 May, 17:23
Tristan107 How to share a (spring) singleton service with Spark? Fri, 22 May, 17:26
Justin Pihony Why is RDD to PairRDDFunctions only via implicits? Fri, 22 May, 17:26
Xin Liu Re: Compare LogisticRegression results using Mllib with those using other libraries (e.g. statsmodel) Fri, 22 May, 17:45
ayan guha Re: partitioning after extracting from a hive table? Fri, 22 May, 18:06
Reynold Xin Re: Why is RDD to PairRDDFunctions only via implicits? Fri, 22 May, 18:44
Justin Pihony Re: Why is RDD to PairRDDFunctions only via implicits? Fri, 22 May, 19:08
Imran Rashid Re: FetchFailedException and MetadataFetchFailedException Fri, 22 May, 19:29
DB Tsai Re: Compare LogisticRegression results using Mllib with those using other libraries (e.g. statsmodel) Fri, 22 May, 19:45
Andrew Otto HiveContext fails when querying large external Parquet tables Fri, 22 May, 19:51
Tathagata Das Re: Storing spark processed output to Database asynchronously. Fri, 22 May, 19:54
DB Tsai Re: MLlib: how to get the best model with only the most significant explanatory variables in LogisticRegressionWithLBFGS or LogisticRegressionWithSGD ? Fri, 22 May, 20:07
Imran Rashid Re: Help reading Spark UI tea leaves.. Fri, 22 May, 20:10
Mike Trienis Spark Streaming: all tasks running on one executor (Kinesis + Mongodb) Fri, 22 May, 20:24
yana RE: HiveContext fails when querying large external Parquet tables Fri, 22 May, 20:24
Andrew Otto Re: HiveContext fails when querying large external Parquet tables Fri, 22 May, 20:37
Evo Eftimov RE: Storing spark processed output to Database asynchronously. Fri, 22 May, 20:39
Evo Eftimov RE: Storing spark processed output to Database asynchronously. Fri, 22 May, 20:47
Mike Trienis Re: Spark Streaming: all tasks running on one executor (Kinesis + Mongodb) Fri, 22 May, 20:51
Wang, Ningjun (LNG-NPV) spark on Windows 2008 failed to save RDD to windows shared folder Fri, 22 May, 20:55
Ted Yu Re: spark on Windows 2008 failed to save RDD to windows shared folder Fri, 22 May, 21:01
Michael Armbrust Re: DataFrame groupBy vs RDD groupBy Fri, 22 May, 21:37
Tathagata Das Re: Trying to connect to many topics with several DirectConnect Fri, 22 May, 21:51
Joseph Bradley Re: MLlib: how to get the best model with only the most significant explanatory variables in LogisticRegressionWithLBFGS or LogisticRegressionWithSGD ? Fri, 22 May, 22:04
Todd Nist spark.executor.extraClassPath - Values not picked up by executors Fri, 22 May, 22:15
Brant Seibert Migrate Relational to Distributed Fri, 22 May, 22:22
Edward Sargisson Application on standalone cluster never changes state to be stopped Fri, 22 May, 22:22
Evo Eftimov Re: Spark Streaming: all tasks running on one executor (Kinesis + Mongodb) Fri, 22 May, 22:47
Tathagata Das Re: [Streaming] Non-blocking recommendation in custom receiver documentation and KinesisReceiver's worker.run blocking calll Sat, 23 May, 00:46
ogoh SparkSQL failing while writing into S3 for 'insert into table' Sat, 23 May, 00:50
Yana Kadiyska Re: spark.executor.extraClassPath - Values not picked up by executors Sat, 23 May, 01:39
tyronecai Re: Performance degradation between spark 0.9.3 and 1.3.1 Sat, 23 May, 01:44
Saiph Kappa Dynamic Allocation with Spark Streaming Sat, 23 May, 01:58
Ted Yu Re: Dynamic Allocation with Spark Streaming Sat, 23 May, 02:31
Pramod Biligiri SparkSQL query plan to Stage wise breakdown Sat, 23 May, 02:50
Saiph Kappa Re: Dynamic Allocation with Spark Streaming Sat, 23 May, 03:00
Davies Liu Re: Bigints in pyspark Sat, 23 May, 03:09
Saiph Kappa Re: Dynamic Allocation with Spark Streaming Sat, 23 May, 03:28
Jiang, Zhipeng RE: Question about Serialization in Storage Level Sat, 23 May, 03:40
Ted Yu Re: Dynamic Allocation with Spark Streaming Sat, 23 May, 03:53
donhoff_h 回复: 回复: 回复: How to use spark to access HBase with Security enabled Sat, 23 May, 10:53
Aniket Bhatnagar Re: [Streaming] Non-blocking recommendation in custom receiver documentation and KinesisReceiver's worker.run blocking calll Sat, 23 May, 11:14
Kali.tumm...@gmail.com split function on spark sql created rdd Sat, 23 May, 12:16
Joe Wass Is anyone using Amazon EC2? Sat, 23 May, 14:20
Joe Wass Is anyone using Amazon EC2? (second attempt!) Sat, 23 May, 14:24
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · Next »Thread · Author · Date
Box list
Jun 2021118
May 2021187
Apr 2021267
Mar 2021346
Feb 2021166
Jan 2021242
Dec 2020203
Nov 2020147
Oct 2020236
Sep 2020136
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137