spark-user mailing list archives: October 2015

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · Next »Thread · Author · Date
Conor Fennell Sporadic error after moving from kafka receiver to kafka direct stream Thu, 22 Oct, 00:23
Conor Fennell Sporadic error after moving from kafka receiver to kafka direct stream Thu, 22 Oct, 00:37
Conor Fennell Sporadic error after moving from kafka receiver to kafka direct stream Thu, 22 Oct, 13:43
Conor Fennell Sporadic error after moving from kafka receiver to kafka direct stream Thu, 22 Oct, 14:00
Cukoo Trying PCA on spark but serialization is error thrown Tue, 06 Oct, 16:43
DB Tsai Re: "Too many open files" exception on reduceByKey Thu, 08 Oct, 23:18
DB Tsai Re: What is the difference between ml.classification.LogisticRegression and mllib.classification.LogisticRegressionWithLBFGS Mon, 12 Oct, 20:32
DB Tsai Re: [SPARK MLLIB] could not understand the wrong and inscrutable result of Linear Regression codes Sun, 25 Oct, 21:14
DB Tsai Re: [SPARK MLLIB] could not understand the wrong and inscrutable result of Linear Regression codes Mon, 26 Oct, 02:25
DB Tsai Re: Spark Implementation of XGBoost Mon, 26 Oct, 23:06
DB Tsai Re: Spark Implementation of XGBoost Mon, 26 Oct, 23:07
DB Tsai Re: Spark Implementation of XGBoost Tue, 27 Oct, 08:02
DEVAN M.S. How to get Histogram of all columns in a large CSV / RDD[Array[double]] ? Wed, 21 Oct, 05:08
D..@ Gmail Re: Does feature parity exist between Scala and Python on Spark Wed, 07 Oct, 00:39
Daniel Darabos Re: Is coalesce smart while merging partitions? Thu, 08 Oct, 14:10
Daniel Haviv Insert via HiveContext is slow Thu, 08 Oct, 18:51
Daniel Haviv Re: Insert via HiveContext is slow Thu, 08 Oct, 19:08
Daniel Haviv Re: Insert via HiveContext is slow Fri, 09 Oct, 07:09
Daniel Haviv SQLContext within foreachRDD Mon, 12 Oct, 09:52
Daniel Haviv Re: SQLContext within foreachRDD Mon, 12 Oct, 10:46
Daniel Haviv Generated ORC files cause NPE in Hive Tue, 13 Oct, 18:14
Daniel Haviv HiveContext ignores ("skip.header.line.count"="1") Mon, 26 Oct, 13:32
Daniel Haviv Re: HiveContext ignores ("skip.header.line.count"="1") Tue, 27 Oct, 04:47
Daniel Li Fwd: [Streaming] join events in last 10 minutes Wed, 14 Oct, 01:29
Dave Ariens Kafka Streaming and Filtering > 3000 partitons Wed, 21 Oct, 18:50
Dave Ariens RE: Kafka Streaming and Filtering > 3000 partitons Wed, 21 Oct, 21:07
Dave Moyers Best way to use Spark UDFs via Hive (Spark Thrift Server) Fri, 23 Oct, 00:15
David Bess Install via directions in "Learning Spark". Exception when running bin/pyspark Tue, 13 Oct, 03:44
David Bess Re: Install via directions in "Learning Spark". Exception when running bin/pyspark Tue, 13 Oct, 15:26
David Mitchell Re: How to avoid Spark shuffle spill memory? Wed, 07 Oct, 03:19
David P. Kleinschmidt Spark Streaming (1.5.0) flaky when recovering from checkpoint Fri, 30 Oct, 12:44
Davies Liu Re: Reading JSON in Pyspark throws scala.MatchError Mon, 05 Oct, 18:48
Davies Liu Re: StructType has more rows, than corresponding Row has objects. Mon, 05 Oct, 22:58
Davies Liu Re: weird issue with sqlContext.createDataFrame - pyspark 1.3.1 Fri, 09 Oct, 18:04
Davies Liu Re: Handling expirying state in UDF Mon, 12 Oct, 17:12
Davies Liu Re: pyspark: results differ based on whether persist() has been called Mon, 19 Oct, 17:40
Davies Liu Re: best way to generate per key auto increment numerals after sorting Mon, 19 Oct, 17:45
Davies Liu Re: pyspark groupbykey throwing error: unpack requires a string argument of length 4 Mon, 19 Oct, 17:52
Davies Liu Re: Spark SQL Exception: Conf non-local session path expected to be non-null Tue, 20 Oct, 06:25
Dean Wampler Re: How can I disable logging when running local[*]? Thu, 08 Oct, 01:48
Dean Wampler Re: Spark 1.5.1 ClassNotFoundException in cluster mode. Wed, 14 Oct, 21:51
Dean Wampler Re: Spark Streaming: how to use StreamingContext.queueStream with existing RDD Mon, 26 Oct, 17:32
Dean Wood Re: Packaging a jar for a jdbc connection using sbt assembly and scala. Thu, 29 Oct, 12:15
Debasish Das Re: Running 2 spark application in parallel Fri, 23 Oct, 16:17
Deenar Toraskar Re: Datastore or DB for spark Sat, 10 Oct, 09:45
Deenar Toraskar Re: Running in cluster mode causes native library linking to fail Wed, 14 Oct, 05:50
Deenar Toraskar Re: OutOfMemoryError When Reading Many json Files Wed, 14 Oct, 06:11
Deenar Toraskar Re: Spark DataFrame GroupBy into List Wed, 14 Oct, 06:13
Deenar Toraskar Re: Spark SQL running totals Thu, 15 Oct, 18:35
Deenar Toraskar Re: How to have Single refernce of a class in Spark Streaming? Sat, 17 Oct, 12:13
Deenar Toraskar Re: Spark SQL Thriftserver and Hive UDF in Production Mon, 19 Oct, 15:22
Deenar Toraskar Re: Spark SQL Exception: Conf non-local session path expected to be non-null Tue, 20 Oct, 06:21
Deenar Toraskar Re: can I use Spark as alternative for gem fire cache ? Tue, 20 Oct, 09:42
Deenar Toraskar Re: Ahhhh... Spark creates >30000 partitions... What can I do? Tue, 20 Oct, 16:20
Deenar Toraskar Re: JdbcRDD Constructor Tue, 20 Oct, 16:31
Deenar Toraskar Accessing external Kerberised resources from Spark executors in Yarn client/cluster mode Thu, 22 Oct, 11:59
Deenar Toraskar Spark 1.5 on CDH 5.4.0 Thu, 22 Oct, 16:04
Deenar Toraskar Re: Maven Repository Hosting for Spark SQL 1.5.1 Thu, 22 Oct, 17:36
Deenar Toraskar Re: Best way to use Spark UDFs via Hive (Spark Thrift Server) Fri, 23 Oct, 10:08
Deenar Toraskar Re: Spark 1.5 on CDH 5.4.0 Fri, 23 Oct, 12:34
Deenar Toraskar Re: Spark 1.5 on CDH 5.4.0 Fri, 23 Oct, 16:31
Deenar Toraskar Re: Spark scala REPL - Unable to create sqlContext Mon, 26 Oct, 06:29
Deenar Toraskar Re: get host from rdd map Mon, 26 Oct, 06:42
Deenar Toraskar Re: Dynamic Resource Allocation with Spark Streaming (Standalone Cluster, Spark 1.5.1) Tue, 27 Oct, 07:23
Deenar Toraskar Re: Broadcast table Tue, 27 Oct, 07:28
Deenar Toraskar Re: get directory names that are affected by sc.textFile("path/to/dir/*/*/*.js") Tue, 27 Oct, 15:13
Deenar Toraskar Re: Packaging a jar for a jdbc connection using sbt assembly and scala. Thu, 29 Oct, 11:14
Deenar Toraskar Re: No way to supply hive-site.xml in yarn client mode? Thu, 29 Oct, 11:20
Deenar Toraskar Re: nested select is not working in spark sql Thu, 29 Oct, 11:24
Deenar Toraskar Re: Spark -- Writing to Partitioned Persistent Table Thu, 29 Oct, 12:14
Deenar Toraskar Re: No way to supply hive-site.xml in yarn client mode? Thu, 29 Oct, 14:16
Deenar Toraskar Re: No way to supply hive-site.xml in yarn client mode? Thu, 29 Oct, 14:26
Deenar Toraskar Re: No way to supply hive-site.xml in yarn client mode? Thu, 29 Oct, 15:14
Deenar Toraskar Re: No way to supply hive-site.xml in yarn client mode? Thu, 29 Oct, 15:44
Deenar Toraskar Re: Pulling data from a secured SQL database Sat, 31 Oct, 12:16
Deepak Sharma Best practises Fri, 30 Oct, 10:53
Deng Ching-Mallete Re: hiveContext sql number of tasks Wed, 07 Oct, 14:37
Deng Ching-Mallete Re: Parquet file size Thu, 08 Oct, 01:14
Deng Ching-Mallete Re: [Yarn-Client]Can not access SparkUI Mon, 26 Oct, 07:30
Deng Ching-Mallete Re: There is any way to write from spark to HBase CDH4? Tue, 27 Oct, 10:03
Deng Ching-Mallete Re: spark to hbase Tue, 27 Oct, 10:21
Deng Ching-Mallete Re: There is any way to write from spark to HBase CDH4? Tue, 27 Oct, 10:39
Deng Ching-Mallete Re: Spark Core Transitive Dependencies Wed, 28 Oct, 02:07
Deng Ching-Mallete Re: Spark Core Transitive Dependencies Thu, 29 Oct, 02:29
Deng Ching-Mallete Re: [Spark] java.lang.IllegalArgumentException: Size exceeds Integer.MAX_VALUE Fri, 30 Oct, 02:13
Deng Ching-Mallete Re: Pivot Data in Spark and Scala Fri, 30 Oct, 02:35
Deng Ching-Mallete Re: How do I parallize Spark Jobs at Executor Level. Fri, 30 Oct, 09:26
Deng Ching-Mallete Re: How do I parallize Spark Jobs at Executor Level. Fri, 30 Oct, 10:33
Denny Lee Spark Survey Results 2015 are now available Mon, 05 Oct, 16:54
Devin Huang Different partition number of GroupByKey leads different result Fri, 09 Oct, 09:05
Devin Huang Re: Different partition number of GroupByKey leads different result Fri, 09 Oct, 09:40
Devin Huang Re: Different partition number of GroupByKey leads different result Fri, 09 Oct, 10:04
Dibyendu Bhattacharya Re: Spark Streaming over YARN Fri, 02 Oct, 16:01
Dibyendu Bhattacharya Re: Spark Streaming over YARN Fri, 02 Oct, 16:21
Dibyendu Bhattacharya Re: Spark Streaming over YARN Sun, 04 Oct, 14:51
Dibyendu Bhattacharya Contributing Receiver based Low Level Kafka Consumer from Spark-Packages to Apache Spark Project Sat, 24 Oct, 15:09
Dibyendu Bhattacharya Re: Need more tasks in KafkaDirectStream Thu, 29 Oct, 06:33
Diggs, Asoka RE: from groupBy return a DataFrame without aggregation? Fri, 02 Oct, 16:49
Dilip Biswal Re: SPARK SQL Error Thu, 15 Oct, 12:13
Dino Fancellu GraphX: How can I tell if 2 nodes are connected? Mon, 05 Oct, 15:51
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · Next »Thread · Author · Date
Box list
Nov 201973
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137