spark-user mailing list archives: August 2015

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · Next »Thread · Author · Date
Deepesh Maheshwari Slow Mongo Read from Spark Mon, 31 Aug, 06:56
Deepesh Maheshwari Re: Slow Mongo Read from Spark Mon, 31 Aug, 07:29
Deepesh Maheshwari Re: Slow Mongo Read from Spark Mon, 31 Aug, 07:44
Deepesh Maheshwari Re: Slow Mongo Read from Spark Mon, 31 Aug, 09:43
Deepesh Maheshwari Write Concern used in Mongo-Hadoop Connector Mon, 31 Aug, 11:39
Dhaval Gmail Re: How to list all dataframes and RDDs available in current session? Tue, 25 Aug, 03:02
Dhaval Patel How to add a new column with date duration from 2 date columns in a dataframe Thu, 20 Aug, 12:18
Dhaval Patel Re: How to add a new column with date duration from 2 date columns in a dataframe Thu, 20 Aug, 12:26
Dhaval Patel Re: How to add a new column with date duration from 2 date columns in a dataframe Thu, 20 Aug, 12:57
Dhaval Patel Re: SparkSQL concerning materials Thu, 20 Aug, 16:29
Dhaval Patel How to list all dataframes and RDDs available in current session? Thu, 20 Aug, 16:49
Dhaval Patel Re: How to list all dataframes and RDDs available in current session? Thu, 20 Aug, 17:04
Dhaval Patel DataFrame/JDBC very slow performance Mon, 24 Aug, 15:17
Dhaval Patel Re: DataFrame/JDBC very slow performance Wed, 26 Aug, 15:14
Dhaval Patel Re: How to add a new column with date duration from 2 date columns in a dataframe Wed, 26 Aug, 15:29
Dibyendu Bhattacharya Re: Reliable Streaming Receiver Thu, 06 Aug, 01:23
Dibyendu Bhattacharya Re: spark streaming 1.3 kafka error Sat, 22 Aug, 16:28
Dibyendu Bhattacharya Re: BlockNotFoundException when running spark word count on Tachyon Wed, 26 Aug, 07:02
Dibyendu Bhattacharya Re: BlockNotFoundException when running spark word count on Tachyon Wed, 26 Aug, 07:05
Dibyendu Bhattacharya Just Released V1.0.4 Low Level Receiver Based Kafka-Spark-Consumer in Spark Packages having built-in Back Pressure Controller Wed, 26 Aug, 16:32
Dick Davies is there a 'knack' to docker and mesos? Sun, 23 Aug, 19:20
Dimitris Kouzis - Loukas Streaming and calculated-once semantics Wed, 05 Aug, 17:46
Dimitris Kouzis - Loukas Re: Pause Spark Streaming reading or sampling streaming data Thu, 06 Aug, 00:27
Dimitris Kouzis - Loukas Re: Pause Spark Streaming reading or sampling streaming data Thu, 06 Aug, 09:32
Dimitris Kouzis - Loukas Re: Pause Spark Streaming reading or sampling streaming data Thu, 06 Aug, 09:33
Dino Fancellu Local Spark talking to remote HDFS? Mon, 24 Aug, 18:46
Dino Fancellu Re: Local Spark talking to remote HDFS? Mon, 24 Aug, 20:10
Dino Fancellu Where is Redgate's HDFS explorer? Mon, 24 Aug, 22:13
Dino Fancellu Re: Local Spark talking to remote HDFS? Tue, 25 Aug, 11:49
Dino Fancellu Re: Where is Redgate's HDFS explorer? Sat, 29 Aug, 11:04
Dmitry Goldenberg How to fix OutOfMemoryError: GC overhead limit exceeded when using Spark Streaming checkpointing Mon, 10 Aug, 15:57
Dmitry Goldenberg Re: How to fix OutOfMemoryError: GC overhead limit exceeded when using Spark Streaming checkpointing Mon, 10 Aug, 16:34
Dmitry Goldenberg Re: How to fix OutOfMemoryError: GC overhead limit exceeded when using Spark Streaming checkpointing Mon, 10 Aug, 16:49
Dmitry Goldenberg Re: How to fix OutOfMemoryError: GC overhead limit exceeded when using Spark Streaming checkpointing Mon, 10 Aug, 20:24
Dmitry Goldenberg Re: How to fix OutOfMemoryError: GC overhead limit exceeded when using Spark Streaming checkpointing Mon, 10 Aug, 20:33
Dmitry Goldenberg Re: Checkpointing doesn't appear to be working for direct streaming from Kafka Fri, 14 Aug, 19:31
Dmitry Goldenberg Re: Checkpointing doesn't appear to be working for direct streaming from Kafka Sat, 15 Aug, 01:19
Du Li Re: Finding the number of executors. Fri, 21 Aug, 21:44
Elkhan Dadashov Re: How does the # of tasks affect # of threads? Tue, 04 Aug, 17:47
Emma Boya Peng ClassCastException when saving a DataFrame to parquet file (saveAsParquetFile, Spark 1.3.1) using Scala Fri, 21 Aug, 07:15
Emma Boya Peng ClassCastException when saving a DataFrame to parquet file (saveAsParquetFile, Spark 1.3.1) using Scala Fri, 21 Aug, 07:25
Enno Shioji Re: Twitter live Streaming Tue, 04 Aug, 08:39
Eric Bless Problems getting expected results from hbase_inputformat.py Fri, 07 Aug, 21:03
Eric Bless Re: Problems getting expected results from hbase_inputformat.py Mon, 10 Aug, 18:08
Eric Bless Boosting spark.yarn.executor.memoryOverhead Tue, 11 Aug, 21:40
Eric Friedman Re: build spark 1.4.1 with JDK 1.6 Tue, 25 Aug, 02:45
Eric Friedman Re: build spark 1.4.1 with JDK 1.6 Tue, 25 Aug, 17:31
Eric Walker adding a custom Scala RDD for use in PySpark Tue, 11 Aug, 22:20
Eric Walker registering an empty RDD as a temp table in a PySpark SQL context Mon, 17 Aug, 19:53
Eric Walker bulk upload to Elasticsearch and shuffle behavior Mon, 31 Aug, 23:09
Eugene Morozov Re: Debugging Spark job in Eclipse Wed, 05 Aug, 14:00
Eugene Morozov Re: How to distribute non-serializable object in transform task or broadcast ? Fri, 07 Aug, 15:51
Eugene Morozov Re: How to distribute non-serializable object in transform task or broadcast ? Fri, 07 Aug, 17:44
Eugene Morozov Re: grouping by a partitioned key Tue, 11 Aug, 22:27
Eugene Morozov Re: Possible issue for Spark SQL/DataFrame Wed, 12 Aug, 11:50
Eugene Morozov Does Spark optimization might miss to run transformation? Wed, 12 Aug, 14:06
Eugene Morozov Re: Sorted Multiple Outputs Thu, 13 Aug, 00:06
Eugene Morozov Re: using Spark or pig group by efficient in my use case? Thu, 13 Aug, 13:24
Eugene Morozov Re: DataFrame column structure change Thu, 13 Aug, 13:45
Eugene Morozov Eviction of RDD persisted on disk Thu, 13 Aug, 14:15
Eugene Morozov DataFrame. SparkPlan / Project serialization issue: ArrayIndexOutOfBounds. Fri, 21 Aug, 10:37
Ewan Higgs Re: How to increase the Json parsing speed Fri, 28 Aug, 07:42
Ewan Leith RE: Specifying the role when launching an AWS spark cluster using spark_ec2 Fri, 07 Aug, 10:51
Ewan Leith Parquet file organisation for 100GB+ dataframes Wed, 12 Aug, 10:28
Ewan Leith Create column in nested structure? Thu, 13 Aug, 14:44
Ewan Leith RE: Create column in nested structure? Thu, 13 Aug, 14:54
Ewan Leith Selecting different levels of nested data records during one select? Thu, 27 Aug, 09:08
Ewan Leith RE: Selecting different levels of nested data records during one select? Thu, 27 Aug, 10:52
Ewan Leith RE: Driver running out of memory - caused by many tasks? Thu, 27 Aug, 11:09
Ewan Leith RE: How to increase the Json parsing speed Fri, 28 Aug, 09:04
Ewan Leith RE: correct use of DStream foreachRDD Fri, 28 Aug, 14:43
Eyal Fink spark on mesos with docker from private repository Thu, 06 Aug, 05:45
Fabrice Sznajderman Re: How does the # of tasks affect # of threads? Sat, 01 Aug, 21:33
Fang, Mike control the number of reducers for groupby in data frame Wed, 05 Aug, 05:47
Fang, Mike Re: control the number of reducers for groupby in data frame Wed, 05 Aug, 13:03
Felix Cheung RE: SparkR csv without headers Fri, 21 Aug, 19:57
Felix Cheung RE: SparkR: exported functions Wed, 26 Aug, 07:08
Felix Neutatz Fwd: Issue with building Spark v1.4.1-rc4 with Scala 2.11 Wed, 26 Aug, 14:07
Feynman Liang Re: Label based MLLib MulticlassMetrics is buggy Wed, 05 Aug, 16:57
Feynman Liang Re: Label based MLLib MulticlassMetrics is buggy Wed, 05 Aug, 16:57
Feynman Liang Re: Label based MLLib MulticlassMetrics is buggy Wed, 05 Aug, 20:16
Feynman Liang Re: miniBatchFraction for LinearRegressionWithSGD Fri, 07 Aug, 16:05
Feynman Liang Re: Spark MLib v/s SparkR Fri, 07 Aug, 16:43
Feynman Liang Re: miniBatchFraction for LinearRegressionWithSGD Fri, 07 Aug, 18:24
Feynman Liang Re: miniBatchFraction for LinearRegressionWithSGD Fri, 07 Aug, 20:34
Feynman Liang Re: mllib on (key, Iterable[Vector]) Tue, 11 Aug, 21:07
Feynman Liang Re: MLlib Prefixspan implementation Tue, 25 Aug, 04:15
Feynman Liang Re: CHAID Decision Trees Tue, 25 Aug, 18:12
Feynman Liang Re: Adding/subtracting org.apache.spark.mllib.linalg.Vector in Scala? Tue, 25 Aug, 18:23
Feynman Liang Re: CHAID Decision Trees Wed, 26 Aug, 05:40
Feynman Liang Re: MLlib Prefixspan implementation Wed, 26 Aug, 07:15
Feynman Liang Re: Spark MLLIB multiclass calssification Sun, 30 Aug, 05:32
Feynman Liang Re: How to generate spark assembly (jar file) using Intellij Sun, 30 Aug, 05:33
Feynman Liang Re: Spark MLLIB multiclass calssification Sun, 30 Aug, 05:51
Filli Alem AW: Twitter live Streaming Tue, 04 Aug, 12:29
Florian M [SparkR] How to perform a for loop on a DataFrame object Thu, 20 Aug, 10:10
Ford Farline Re: Problem submiting an script .py against an standalone cluster. Tue, 04 Aug, 21:36
Franc Carter subscribe Thu, 06 Aug, 05:51
Franc Carter SparkR csv without headers Wed, 19 Aug, 05:48
Franc Carter Re: SparkR csv without headers Fri, 21 Aug, 05:30
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · Next »Thread · Author · Date
Box list
Sep 2021114
Aug 2021171
Jul 2021158
Jun 2021179
May 2021187
Apr 2021267
Mar 2021346
Feb 2021166
Jan 2021242
Dec 2020203
Nov 2020147
Oct 2020236
Sep 2020136
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137