spark-user mailing list archives: October 2014

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · 22 · 23 · Next »Thread · Author · Date
Sonal Goyal Re: Dedup Thu, 09 Oct, 05:00
Sonal Goyal Re: Join with large data set Fri, 17 Oct, 06:06
Sonal Goyal Re: key class requirement for PairedRDD ? Fri, 17 Oct, 07:03
Sonal Goyal Re: Optimizing pairwise similarity computation or how to avoid RDD.cartesian operation ? Fri, 17 Oct, 12:02
Sonal Goyal Re: How to disable input split Sat, 18 Oct, 10:16
Sonal Goyal Re: Rdd of Rdds Wed, 22 Oct, 20:52
Sonal Goyal Java api overhead? Mon, 27 Oct, 18:25
Sonal Goyal Re: Java api overhead? Thu, 30 Oct, 04:41
Sonal Goyal Re: Doing RDD."count" in parallel , at at least parallelize it as much as possible? Fri, 31 Oct, 04:47
Sonal Goyal Re: Submiting Spark application through code Fri, 31 Oct, 06:21
Sonal Goyal Re: Using a Database to persist and load data from Fri, 31 Oct, 08:03
Sonal Goyal Re: A Spark Design Problem Fri, 31 Oct, 18:56
Sonal Goyal Re: LinearRegression and model prediction threshold Fri, 31 Oct, 18:57
Soumitra Johri RDD Indexes and how to fetch all edges with a given label Tue, 14 Oct, 18:46
Soumitra Johri Re: RDD Indexes and how to fetch all edges with a given label Tue, 14 Oct, 22:31
Soumitra Kumar Re: Kafka->HDFS to store as Parquet format Tue, 07 Oct, 17:09
Soumitra Kumar Re: Kafka->HDFS to store as Parquet format Tue, 07 Oct, 17:24
Soumitra Kumar Re: How to add HBase dependencies and conf with spark-submit? Wed, 15 Oct, 14:39
Soumitra Kumar Re: How to add HBase dependencies and conf with spark-submit? Thu, 16 Oct, 15:59
Soumitra Kumar How to name a DStream Thu, 16 Oct, 20:00
Soumitra Kumar Print dependency graph as DOT file Thu, 16 Oct, 22:26
Soumitra Siddharth Johri How to construct graph in graphx Mon, 13 Oct, 22:22
Soumya Simanta Creating a feature vector from text before using with MLLib Wed, 01 Oct, 21:18
Soumya Simanta Storing shuffle files on a Tachyon Tue, 07 Oct, 22:46
Soumya Simanta Convert a org.apache.spark.sql.SchemaRDD[Row] to a RDD of Strings Thu, 09 Oct, 20:22
Soumya Simanta Does start-slave.sh use the values in conf/slaves to launch a worker in Spark standalone cluster mode Tue, 21 Oct, 04:55
Soumya Simanta Re: Spark as Relational Database Sun, 26 Oct, 03:34
Soumya Simanta Re: Spark as Relational Database Sun, 26 Oct, 15:13
Soumya Simanta Re: Spark as Relational Database Mon, 27 Oct, 01:14
Soumya Simanta Re: scalac crash when compiling DataTypeConversions.scala Tue, 28 Oct, 01:12
Soumya Simanta Re: install sbt Tue, 28 Oct, 16:19
Soumya Simanta Re: run multiple spark applications in parallel Tue, 28 Oct, 23:15
Soumya Simanta Re: run multiple spark applications in parallel Wed, 29 Oct, 00:31
Soumya Simanta Re: sbt/sbt compile error [FATAL] Wed, 29 Oct, 11:39
Soumya Simanta SparkSQL performance Fri, 31 Oct, 23:04
Sourav Chandra Re: What's wrong with my spark filter? I get "org.apache.spark.SparkException: Task not serializable" Fri, 17 Oct, 10:41
Stephen Boesch Implicit conversion RDD -> SchemaRDD Thu, 02 Oct, 09:00
Stephen Boesch Re: Implicit conversion RDD -> SchemaRDD Thu, 02 Oct, 14:20
Stephen Boesch Setup/Cleanup for RDD closures? Fri, 03 Oct, 04:46
Stephen Boesch Building pyspark with maven? Wed, 08 Oct, 21:01
Stephen Boesch Re: Building pyspark with maven? Wed, 08 Oct, 21:12
Stephen Boesch Re: distributing Scala Map datatypes to RDD Mon, 13 Oct, 21:58
Stephen Boesch NoClassDefFoundError on ThreadFactoryBuilder in Intellij Thu, 23 Oct, 08:43
Stephen Boesch Re: scalac crash when compiling DataTypeConversions.scala Mon, 27 Oct, 03:46
Stephen Boesch Re: scalac crash when compiling DataTypeConversions.scala Mon, 27 Oct, 04:08
Stephen Boesch How to import mllib.rdd.RDDFunctions into the spark-shell Tue, 28 Oct, 09:09
Stephen Boesch Re: How to import mllib.rdd.RDDFunctions into the spark-shell Tue, 28 Oct, 09:29
Stephen Boesch Re: NoClassDefFoundError on ThreadFactoryBuilder in Intellij Tue, 28 Oct, 09:31
Stephen Boesch Re: NoClassDefFoundError on ThreadFactoryBuilder in Intellij Wed, 29 Oct, 00:18
Stephen Boesch Returned type of Broadcast variable is byte array Thu, 30 Oct, 14:42
Stephen Boesch Re: Returned type of Broadcast variable is byte array Thu, 30 Oct, 18:02
Steve Arnold Unable to share Sql between HiveContext and JDBC Thrift Server Fri, 10 Oct, 01:32
Steve Lewis A sample for generating big data - and some design questions Wed, 01 Oct, 00:16
Steve Lewis What can be done if a FlatMapFunctions generated more data that can be held in memory Thu, 02 Oct, 01:01
Steve Lewis Re: Spark and Python using generator of data bigger than RAM as input to sc.parallelize() Mon, 06 Oct, 20:39
Steve Lewis Stupid Spark question Tue, 07 Oct, 18:01
Steve Lewis anyone else seeing something like https://issues.apache.org/jira/browse/SPARK-3637 Tue, 07 Oct, 21:45
Steve Lewis Broadcast Torrent fail - then the job dies Wed, 08 Oct, 18:21
Steve Lewis Re: Broadcast Torrent fail - then the job dies Wed, 08 Oct, 21:59
Steve Lewis How to I get at a SparkContext or better a JavaSparkContext from the middle of a function Tue, 14 Oct, 23:47
Steve Lewis How do you write a JavaRDD into a single file Mon, 20 Oct, 22:13
Steve Lewis Re: How do you write a JavaRDD into a single file Mon, 20 Oct, 22:53
Steve Lewis Re: How do you write a JavaRDD into a single file Tue, 21 Oct, 16:27
Steve Lewis com.esotericsoftware.kryo.KryoException: Encountered unregistered class ID: 13994 Wed, 29 Oct, 02:43
Steve Lewis Re: com.esotericsoftware.kryo.KryoException: Encountered unregistered class ID: 13994 Wed, 29 Oct, 03:36
Steve Lewis Questions about serialization and SparkConf Wed, 29 Oct, 18:57
Steve Lewis A Spark Design Problem Fri, 31 Oct, 16:44
Steve Nunez Re: Breaking the previous large-scale sort record with Spark Fri, 10 Oct, 16:17
Stuart Horsman SparkContext UI Thu, 30 Oct, 23:30
Stuart Horsman Re: SparkContext UI Thu, 30 Oct, 23:50
Sunandan Chakraborty Help with an error Tue, 21 Oct, 02:08
Sung Hwan Chung Is RDD partition index consistent? Mon, 06 Oct, 19:33
Sung Hwan Chung Is there a way to look at RDD's lineage? Or debug a fault-tolerance error? Wed, 08 Oct, 19:01
Sung Hwan Chung Re: Is there a way to look at RDD's lineage? Or debug a fault-tolerance error? Wed, 08 Oct, 19:24
Sung Hwan Chung Re: Is there a way to look at RDD's lineage? Or debug a fault-tolerance error? Wed, 08 Oct, 22:32
Sung Hwan Chung coalesce with shuffle or repartition is not necessarily fault-tolerant Wed, 08 Oct, 22:42
Sung Hwan Chung Re: java.io.IOException Error in task deserialization Thu, 09 Oct, 04:13
Sung Hwan Chung Re: coalesce with shuffle or repartition is not necessarily fault-tolerant Thu, 09 Oct, 06:51
Sung Hwan Chung Re: coalesce with shuffle or repartition is not necessarily fault-tolerant Thu, 09 Oct, 07:11
Sung Hwan Chung Re: java.io.IOException Error in task deserialization Thu, 09 Oct, 23:28
Sung Hwan Chung Intermittent checkpointing failure. Fri, 10 Oct, 05:18
Sung Hwan Chung Re: java.io.IOException Error in task deserialization Fri, 10 Oct, 15:46
Sung Hwan Chung Spark job (not Spark streaming) doesn't delete un-needed checkpoints. Fri, 10 Oct, 16:15
Sung Hwan Chung Re: java.io.IOException Error in task deserialization Fri, 10 Oct, 18:20
Sungwook Yoon Re: Spark And Mapr Wed, 01 Oct, 23:12
Sunny Khatri Re: Fwd: Spark SQL: ArrayIndexOutofBoundsException Thu, 02 Oct, 23:06
Sunny Khatri Re: return probability \ confidence instead of actual class Mon, 06 Oct, 17:35
Sunny Khatri Re: Cannot read from s3 using "sc.textFile" Tue, 07 Oct, 16:51
Sunny Khatri Re: return probability \ confidence instead of actual class Tue, 07 Oct, 18:45
Sunny Khatri Re: Shuffle files Tue, 07 Oct, 23:02
Surendranauth Hiraman Re: Spark And Mapr Thu, 02 Oct, 00:18
Surendranauth Hiraman Re: Play framework Thu, 16 Oct, 16:42
Svend Re: Local Dev Env with Mesos + Spark Streaming on Docker: Can't submit jobs. Fri, 24 Oct, 01:34
TANG Gen The question about mount ephemeral disk in slave-setup.sh Fri, 03 Oct, 10:05
TANG Gen Re: The question about mount ephemeral disk in slave-setup.sh Fri, 03 Oct, 21:18
TANG Gen Re: Spark Monitoring with Ganglia Fri, 03 Oct, 21:34
TANG Gen Re: Spark SQL -- more than two tables for join Tue, 07 Oct, 14:11
TJ Klein Spark-Submit Python along with JAR Tue, 21 Oct, 19:57
TJ Klein Yarn-Client Python Tue, 28 Oct, 19:50
TJ Klein Re: Yarn-Client Python Tue, 28 Oct, 20:35
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · 22 · 23 · Next »Thread · Author · Date
Box list
Sep 202181
Aug 2021171
Jul 2021158
Jun 2021179
May 2021187
Apr 2021267
Mar 2021346
Feb 2021166
Jan 2021242
Dec 2020203
Nov 2020147
Oct 2020236
Sep 2020136
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137