spark-user mailing list archives: August 2015

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · Next »Thread · Author · Date
Masf SQLContext load. Filtering files Wed, 19 Aug, 17:16
Masf Spark 1.3. Insert into hive parquet partitioned table from DataFrame Thu, 20 Aug, 10:25
Masf Re: SQLContext load. Filtering files Thu, 27 Aug, 10:51
MasterSergius Run scala code with spark submit Thu, 20 Aug, 17:07
Matt Forbes Input size increasing every iteration of gradient boosted trees [1.4] Thu, 13 Aug, 21:04
Matt Forbes Re: Input size increasing every iteration of gradient boosted trees [1.4] Fri, 14 Aug, 00:29
Matt Narrell Re: Spark RDD join with CassandraRDD Tue, 25 Aug, 15:35
Meihua Wu Does RDD.cartesian involve shuffling? Mon, 03 Aug, 16:56
Meihua Wu Re: Does RDD.cartesian involve shuffling? Tue, 04 Aug, 16:25
Meihua Wu Re: miniBatchFraction for LinearRegressionWithSGD Fri, 07 Aug, 18:16
Meihua Wu Re: miniBatchFraction for LinearRegressionWithSGD Fri, 07 Aug, 19:44
Michael Albert Re: How to avoid executor time out on yarn spark while dealing with large shuffle skewed data? Fri, 21 Aug, 10:54
Michael Armbrust Re: how to ignore MatchError then processing a large json file in spark-sql Mon, 03 Aug, 20:43
Michael Armbrust Re: how to convert a sequence of TimeStamp to a dataframe Mon, 03 Aug, 20:53
Michael Armbrust Re: shutdown local hivecontext? Mon, 03 Aug, 22:56
Michael Armbrust Re: Spark SQL support for Hive 0.14 Tue, 04 Aug, 18:23
Michael Armbrust Re: Spark SQL Hive - merge small files Wed, 05 Aug, 17:02
Michael Armbrust Re: Spark SQL query AVRO file Fri, 07 Aug, 18:32
Michael Armbrust Re: Spark SQL query AVRO file Fri, 07 Aug, 18:44
Michael Armbrust Re: Pagination on big table, splitting joins Mon, 10 Aug, 18:31
Michael Armbrust Re: How to use custom Hadoop InputFormat in DataFrame? Mon, 10 Aug, 18:34
Michael Armbrust Re: Spark inserting into parquet files with different schema Mon, 10 Aug, 18:36
Michael Armbrust Re: Spark inserting into parquet files with different schema Mon, 10 Aug, 19:44
Michael Armbrust Re: Is there any external dependencies for lag() and lead() when using data frames? Mon, 10 Aug, 21:03
Michael Armbrust Re: Does Spark optimization might miss to run transformation? Wed, 12 Aug, 18:29
Michael Armbrust Re: About Databricks's spark-sql-perf Thu, 13 Aug, 18:13
Michael Armbrust Re: Json Serde used by Spark Sql Tue, 18 Aug, 20:54
Michael Armbrust Re: Does spark sql support column indexing Wed, 19 Aug, 18:49
Michael Armbrust Re: DataFrameWriter.jdbc is very slow Thu, 20 Aug, 19:03
Michael Armbrust Re: Data frame created from hive table and its partition Thu, 20 Aug, 19:05
Michael Armbrust Re: Spark Sql behaves strangely with tables with a lot of partitions Mon, 24 Aug, 02:16
Michael Armbrust Re: SparkSQL concerning materials Mon, 24 Aug, 02:18
Michael Armbrust Re: Drop table and Hive warehouse Mon, 24 Aug, 15:43
Michael Armbrust Re: Spark Sql behaves strangely with tables with a lot of partitions Mon, 24 Aug, 18:12
Michael Armbrust Re: Spark Sql behaves strangely with tables with a lot of partitions Mon, 24 Aug, 19:18
Michael Armbrust Re: Spark Sql behaves strangely with tables with a lot of partitions Mon, 24 Aug, 19:22
Michael Armbrust Re: Array Out OF Bound Exception Mon, 24 Aug, 21:23
Michael Armbrust Re: DataFrame/JDBC very slow performance Mon, 24 Aug, 22:38
Michael Armbrust Re: What does Attribute and AttributeReference mean in Spark SQL Tue, 25 Aug, 07:37
Michael Armbrust Re: query avro hive table in spark sql Thu, 27 Aug, 00:48
Michael Armbrust Re: How to unit test HiveContext without OutOfMemoryError (using sbt) Thu, 27 Aug, 00:53
Michael Armbrust Re: Differing performance in self joins Thu, 27 Aug, 01:27
Michael Armbrust Re: query avro hive table in spark sql Thu, 27 Aug, 19:02
Michael Armbrust Re: Data Frame support CSV or excel format ? Thu, 27 Aug, 19:35
Michael Knapp EOFException when transmitting a class that extends Externalizable Mon, 03 Aug, 17:10
Michael Malak Re: Build k-NN graph for large dataset Wed, 26 Aug, 14:57
Michael Segel Re: TCP/IP speedup Sun, 02 Aug, 16:12
Michal Monselise Fwd: Join with multiple conditions (In reference to SPARK-7197) Tue, 25 Aug, 18:21
Michal Monselise Re: Join with multiple conditions (In reference to SPARK-7197) Wed, 26 Aug, 23:59
Michel Robert unsubscribe Tue, 11 Aug, 16:47
Mike Sukmanowsky PySpark concurrent jobs using single SparkContext Thu, 20 Aug, 13:34
Mike Trienis Optimal way to implement a small lookup table for identifiers in an RDD Mon, 10 Aug, 21:13
Mike Trienis Spark SQL window functions (RowsBetween) Thu, 20 Aug, 22:32
Mike Trienis How to unit test HiveContext without OutOfMemoryError (using sbt) Tue, 25 Aug, 18:10
Mike Trienis Re: How to unit test HiveContext without OutOfMemoryError (using sbt) Wed, 26 Aug, 17:51
Mohammed Guller Spark SQL unable to recognize schema name Tue, 04 Aug, 18:45
Mohammed Guller RE: Combining Spark Files with saveAsTextFile Wed, 05 Aug, 04:38
Mohammed Guller RE: Combining Spark Files with saveAsTextFile Wed, 05 Aug, 04:43
Mohit Anchlia Checkpoint Dir Error in Yarn Sat, 08 Aug, 00:48
Mohit Anchlia Streaming of WordCount example Mon, 10 Aug, 17:29
Mohit Anchlia Re: Streaming of WordCount example Mon, 10 Aug, 18:43
Mohit Anchlia Re: Streaming of WordCount example Mon, 10 Aug, 19:43
Mohit Anchlia Re: Streaming of WordCount example Mon, 10 Aug, 23:15
Mohit Anchlia Re: Streaming of WordCount example Mon, 10 Aug, 23:21
Mohit Anchlia ClassNotFound spark streaming Tue, 11 Aug, 20:52
Mohit Anchlia Re: ClassNotFound spark streaming Tue, 11 Aug, 22:02
Mohit Anchlia Partitioning in spark streaming Tue, 11 Aug, 23:06
Mohit Anchlia Re: ClassNotFound spark streaming Tue, 11 Aug, 23:08
Mohit Anchlia Re: Partitioning in spark streaming Wed, 12 Aug, 00:35
Mohit Anchlia Re: Partitioning in spark streaming Wed, 12 Aug, 04:53
Mohit Anchlia Unit Testing Wed, 12 Aug, 23:31
Mohit Anchlia Spark RuntimeException hadoop output format Thu, 13 Aug, 17:49
Mohit Anchlia Re: Spark RuntimeException hadoop output format Fri, 14 Aug, 05:21
Mohit Anchlia Re: Spark RuntimeException hadoop output format Fri, 14 Aug, 20:36
Mohit Anchlia Re: Spark RuntimeException hadoop output format Fri, 14 Aug, 23:38
Mohit Anchlia Executors on multiple nodes Fri, 14 Aug, 23:40
Mohit Anchlia Too many files/dirs in hdfs Fri, 14 Aug, 23:50
Mohit Anchlia Re: Too many files/dirs in hdfs Tue, 18 Aug, 17:35
Mohit Anchlia Re: Too many files/dirs in hdfs Wed, 19 Aug, 16:38
Mohit Anchlia Re: Too many files/dirs in hdfs Mon, 24 Aug, 21:51
Mohit Anchlia Re: Too many files/dirs in hdfs Tue, 25 Aug, 20:20
Mohit Durgapal spark-kafka directAPI vs receivers based API Mon, 10 Aug, 12:51
Mohit Durgapal how do you convert directstream into data frames Fri, 14 Aug, 04:41
Mohit Durgapal Re: how do you convert directstream into data frames Fri, 14 Aug, 05:15
MooseSpark Re: issue Running Spark Job on Yarn Cluster Tue, 18 Aug, 07:20
MooseSpark Changed Column order in DataFrame.Columns call and insertIntoJDBC Tue, 18 Aug, 07:40
Moshe Eshel Spark UI returning error 500 in yarn-client mode Wed, 19 Aug, 09:51
MrJew Standalone Cluster Local Authentication Mon, 03 Aug, 17:05
Mridul Muralidharan Re: Spark runs into an Infinite loop even if the tasks are completed successfully Fri, 14 Aug, 07:34
Muhammad Atif Re: SparkSQL concerning materials Thu, 20 Aug, 14:50
Muhammad Haseeb Javed Difference between Sort based and Hash based shuffle Sat, 15 Aug, 20:42
Muhammad Haseeb Javed Re: Difference between Sort based and Hash based shuffle Sun, 16 Aug, 18:08
Muhammad Haseeb Javed Re: Difference between Sort based and Hash based shuffle Wed, 19 Aug, 12:52
Muhammad Haseeb Javed Building spark-examples takes too much time using Maven Wed, 26 Aug, 14:56
Muler Newbie question: can shuffle avoid writing and reading from disk? Wed, 05 Aug, 23:10
Muler Re: Newbie question: can shuffle avoid writing and reading from disk? Wed, 05 Aug, 23:50
Muler Re: Newbie question: can shuffle avoid writing and reading from disk? Thu, 06 Aug, 01:21
Muler Newbie question: what makes Spark run faster than MapReduce Fri, 07 Aug, 16:13
Muler Spark is in-memory processing, how then can Tachyon make Spark faster? Fri, 07 Aug, 16:42
Muler Error:(46, 66) not found: type SparkFlumeProtocol Tue, 25 Aug, 16:50
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · Next »Thread · Author · Date
Box list
Oct 202165
Sep 2021126
Aug 2021171
Jul 2021158
Jun 2021179
May 2021187
Apr 2021267
Mar 2021346
Feb 2021166
Jan 2021242
Dec 2020203
Nov 2020147
Oct 2020236
Sep 2020136
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137