spark-user mailing list archives: October 2015

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · Next »Thread · Author · Date
swetha Usage of transform for code reuse between Streaming and Batch job affects the performance ? Mon, 05 Oct, 00:59
Koert Kuipers Re: Secondary Sorting in Spark Mon, 05 Oct, 01:51
Alex Rovner Re: How to optimize group by query fired using hiveContext.sql? Mon, 05 Oct, 02:30
YaoPau Spark SQL with Hive error: "Conf non-local session path expected to be non-null;" Mon, 05 Oct, 02:41
Julius Fernandes Spark 1.5.0 Error on startup Mon, 05 Oct, 03:26
jeff saremi How to install a Spark Package? Mon, 05 Oct, 03:55
Ted Yu Re: How to install a Spark Package? Mon, 05 Oct, 04:05
satish chandra j Re: Scala Limitation - Case Class definition with more than 22 arguments Mon, 05 Oct, 05:19
Hemminger Jeff String operation in filter with a special character Mon, 05 Oct, 05:59
Justin Pihony K-Means seems biased to one center Mon, 05 Oct, 06:00
Jeff Thompson Re: performance difference between Thrift server and SparkSQL? Mon, 05 Oct, 06:21
Ramkumar V OutOfMemoryError Mon, 05 Oct, 06:56
Tamas Szuromi looking for HDP users Mon, 05 Oct, 07:23
Jean-Baptiste Onofré Re: OutOfMemoryError Mon, 05 Oct, 08:06
William Saar Graphx hangs and crashes on EdgeRDD creation Mon, 05 Oct, 08:14
Krzysztof Zarzycki Re: Store DStreams into Hive using Hive Streaming Mon, 05 Oct, 09:21
Steve Loughran Re: Spark 1.5.0 Error on startup Mon, 05 Oct, 10:00
Umesh Kacha Re: Store DStreams into Hive using Hive Streaming Mon, 05 Oct, 10:07
Ramkumar V Re: OutOfMemoryError Mon, 05 Oct, 10:18
Julius Fernandes Re: Spark 1.5.0 Error on startup Mon, 05 Oct, 10:24
Cesar Berezowski Job on Yarn not using all given capacity ends up failing Mon, 05 Oct, 10:56
Eugene Morozov StructType has more rows, than corresponding Row has objects. Mon, 05 Oct, 11:28
tarek.abouzei...@yahoo.com.INVALID Spark handling parallel requests Mon, 05 Oct, 13:16
Andreas Fritzler [Spark on YARN] Multiple Auxiliary Shuffle Service Versions Mon, 05 Oct, 13:22
jeff saremi RE: How to install a Spark Package? Mon, 05 Oct, 13:33
Younes Naguib Spark context on thrift server Mon, 05 Oct, 13:38
jayendra.par...@yahoo.in Error: could not find function "includePackage" Mon, 05 Oct, 13:46
Alex Rovner Re: [Spark on YARN] Multiple Auxiliary Shuffle Service Versions Mon, 05 Oct, 13:59
Koen Vantomme RE: Error: could not find function "includePackage" Mon, 05 Oct, 14:08
Prateek . DStream Transformation to save JSON in Cassandra 2.1 Mon, 05 Oct, 14:14
Steve Loughran Re: [Spark on YARN] Multiple Auxiliary Shuffle Service Versions Mon, 05 Oct, 14:18
Ashish Soni Re: DStream Transformation to save JSON in Cassandra 2.1 Mon, 05 Oct, 14:27
Jean-Baptiste Onofré Re: DStream Transformation to save JSON in Cassandra 2.1 Mon, 05 Oct, 14:27
Ted Yu Re: Error: could not find function "includePackage" Mon, 05 Oct, 14:33
Alex Rovner Re: [Spark on YARN] Multiple Auxiliary Shuffle Service Versions Mon, 05 Oct, 14:48
Steve Loughran Re: [Spark on YARN] Multiple Auxiliary Shuffle Service Versions Mon, 05 Oct, 14:54
Justin Permar save checkpoint during dataframe row iteration Mon, 05 Oct, 14:59
Andreas Fritzler Re: [Spark on YARN] Multiple Auxiliary Shuffle Service Versions Mon, 05 Oct, 15:06
Lan Jiang Re: "java.io.IOException: Filesystem closed" on executors Mon, 05 Oct, 15:25
Don Drake Utility for PySpark DataFrames - smartframes Mon, 05 Oct, 15:35
mvle Spark on YARN using Java 1.8 fails Mon, 05 Oct, 15:41
Dino Fancellu GraphX: How can I tell if 2 nodes are connected? Mon, 05 Oct, 15:51
Saif.A.Ell...@wellsfargo.com How to change verbosity level and redirect verbosity to file? Mon, 05 Oct, 16:12
VJ Anand Custom RDD for Proprietary MPP database Mon, 05 Oct, 16:15
Ted Yu Re: Spark on YARN using Java 1.8 fails Mon, 05 Oct, 16:18
dpristin Broadcast var is null Mon, 05 Oct, 16:23
cherah30 Exception: "You must build Spark with Hive. Export 'SPARK_HIVE=true' and run build/sbt assembly" Mon, 05 Oct, 16:25
Ted Yu Re: Exception: "You must build Spark with Hive. Export 'SPARK_HIVE=true' and run build/sbt assembly" Mon, 05 Oct, 16:29
Dino Fancellu Re: GraphX: How can I tell if 2 nodes are connected? Mon, 05 Oct, 16:40
Denny Lee Spark Survey Results 2015 are now available Mon, 05 Oct, 16:54
Robineast Re: GraphX: How can I tell if 2 nodes are connected? Mon, 05 Oct, 17:01
Kristina Rogale Plazonic Where to put import sqlContext.implicits._ to be able to work on DataFrames in another file? Mon, 05 Oct, 17:05
gtanguy Spark metrics cpu/memory Mon, 05 Oct, 17:19
evg952 Pyspark 1.5.1: Error when using findSynonyms after loading Word2VecModel Mon, 05 Oct, 17:51
VJ Building RDD for a Custom MPP Database Mon, 05 Oct, 17:53
Jakub Dubovsky RDD of ImmutableList Mon, 05 Oct, 18:04
Igor Berman Re: RDD of ImmutableList Mon, 05 Oct, 18:10
java8964 RE: Building RDD for a Custom MPP Database Mon, 05 Oct, 18:14
Fernando Paladini Re: "Method json([class java.util.HashMap]) does not exist" when reading JSON on PySpark Mon, 05 Oct, 18:23
Saif.A.Ell...@wellsfargo.com Please help: Processes with HiveContext slower in cluster Mon, 05 Oct, 18:24
Umesh Kacha Re: How to optimize group by query fired using hiveContext.sql? Mon, 05 Oct, 18:28
Jakub Dubovsky Re: RDD of ImmutableList Mon, 05 Oct, 18:42
Dino Fancellu Re: GraphX: How can I tell if 2 nodes are connected? Mon, 05 Oct, 18:44
Davies Liu Re: Reading JSON in Pyspark throws scala.MatchError Mon, 05 Oct, 18:48
Ted Yu Re: Exception: "You must build Spark with Hive. Export 'SPARK_HIVE=true' and run build/sbt assembly" Mon, 05 Oct, 18:52
Anwar Rizal Re: GraphX: How can I tell if 2 nodes are connected? Mon, 05 Oct, 19:03
Fernando Paladini Re: "Method json([class java.util.HashMap]) does not exist" when reading JSON on PySpark Mon, 05 Oct, 19:15
Michael Armbrust Re: "Method json([class java.util.HashMap]) does not exist" when reading JSON on PySpark Mon, 05 Oct, 19:44
Michael Armbrust Re: String operation in filter with a special character Mon, 05 Oct, 19:50
Michael Armbrust Re: Spark context on thrift server Mon, 05 Oct, 19:52
Tathagata Das Re: Broadcast var is null Mon, 05 Oct, 19:52
Adrian Tanase Re: Usage of transform for code reuse between Streaming and Batch job affects the performance ? Mon, 05 Oct, 20:00
Fernando Paladini Re: "Method json([class java.util.HashMap]) does not exist" when reading JSON on PySpark Mon, 05 Oct, 20:04
Adrian Tanase Re: Secondary Sorting in Spark Mon, 05 Oct, 20:06
Adrian Tanase Re: RDD of ImmutableList Mon, 05 Oct, 20:11
Adrian Tanase Re: Broadcast var is null Mon, 05 Oct, 20:14
Michael Armbrust Re: "Method json([class java.util.HashMap]) does not exist" when reading JSON on PySpark Mon, 05 Oct, 20:25
Olivier Girardot Re: Lookup / Access of master data in spark streaming Mon, 05 Oct, 20:40
YaoPau Spark SQL "SELECT ... LIMIT" scans the entire Hive table? Mon, 05 Oct, 20:53
Dmitry Pristin Re: Broadcast var is null Mon, 05 Oct, 21:12
tridib Writing UDF with variable number of arguments Mon, 05 Oct, 21:26
Jeff Nadler Streaming Performance w/ UpdateStateByKey Mon, 05 Oct, 21:28
Michael Armbrust Re: Spark SQL "SELECT ... LIMIT" scans the entire Hive table? Mon, 05 Oct, 21:35
Ruslan Dautkhanov save DF to JDBC Mon, 05 Oct, 21:44
Tathagata Das Re: Streaming Performance w/ UpdateStateByKey Mon, 05 Oct, 21:46
Tathagata Das Re: Lookup / Access of master data in spark streaming Mon, 05 Oct, 21:49
Jagat Singh Re: Spark thrift service and Hive impersonation. Mon, 05 Oct, 21:51
Young, Matthew T RE: save DF to JDBC Mon, 05 Oct, 21:56
Richard Hillegas Re: save DF to JDBC Mon, 05 Oct, 22:00
Hemminger Jeff Re: spark-ec2 config files. Mon, 05 Oct, 22:06
Muhammad Ahsan ERROR: "Size exceeds Integer.MAX_VALUE" Spark 1.5 Mon, 05 Oct, 22:12
Tathagata Das Re: Store DStreams into Hive using Hive Streaming Mon, 05 Oct, 22:14
Jack Yang RE: No space left on device when running graphx job Mon, 05 Oct, 22:43
Davies Liu Re: StructType has more rows, than corresponding Row has objects. Mon, 05 Oct, 22:58
Renato Perini Re: spark-ec2 config files. Mon, 05 Oct, 23:00
Alex Rovner Re: [Spark on YARN] Multiple Auxiliary Shuffle Service Versions Mon, 05 Oct, 23:37
Andrew Or Re: [Spark on YARN] Multiple Auxiliary Shuffle Service Versions Tue, 06 Oct, 00:23
Chen Song question on make multiple external calls within each partition Tue, 06 Oct, 00:35
Tathagata Das Re: spark.streaming.kafka.maxRatePerPartition for direct stream Tue, 06 Oct, 01:05
Tathagata Das Re: Spark streaming job filling a lot of data in local spark nodes Tue, 06 Oct, 01:07
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · Next »Thread · Author · Date
Box list
Jan 2020122
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137