spark-user mailing list archives: August 2016

Site index · List index
Message list1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · Next »Thread · Author · Date
$iddhe$h Divekar PermGen space Error Thu, 04 Aug, 03:14
$iddhe$h Divekar Re: PermGen space Error Thu, 04 Aug, 04:41
$iddhe$h Divekar Spark jobs failing due to java.lang.OutOfMemoryError: PermGen space Thu, 04 Aug, 14:34
$iddhe$h Divekar Re: Spark jobs failing due to java.lang.OutOfMemoryError: PermGen space Thu, 04 Aug, 14:47
何琪 Unsubscribe Tue, 16 Aug, 05:50
zhangjp unsubscribe Tue, 02 Aug, 03:00
Jean-Baptiste Onofré Re: Aggregations with scala pairs Thu, 18 Aug, 06:35
Jean-Baptiste Onofré Re: error when running spark from oozie launcher Thu, 18 Aug, 06:37
شجاع الرحمن بیگ org.apache.spark.shuffle.MetadataFetchFailedException: Missing an output location for shuffle 0 Wed, 10 Aug, 19:34
张梓轩 Bug: Spark Streaming Application Failure Recovery Failed on Windows Fri, 05 Aug, 07:20
李剑 mapWithState handle timeout Sat, 06 Aug, 08:00
林家銘 pyspark pickle error when using itertools.groupby Fri, 05 Aug, 05:31
梅西0247 回复:ApplicationMaster + Fair Scheduler + Dynamic resource allocation Tue, 30 Aug, 13:21
金国栋 Multiple Sources Found for Parquet Mon, 08 Aug, 09:34
陈哲 Questions about ml.random forest (only one decision tree?) Thu, 04 Aug, 08:48
陈哲 MulticlassClassificationEvaluator use Thu, 11 Aug, 04:00
陈哲 How to Improve Random Forest classifier accuracy Thu, 18 Aug, 08:25
陈哲 How to Improve Random Forest classifier accuracy Thu, 18 Aug, 08:31
陈哲 How to continuous update or refresh RandomForestClassificationModel Fri, 19 Aug, 08:21
颜发才(Yan Facai) [Spark 2.0] spark.sql.hive.metastore.jars doesn't work Fri, 12 Aug, 10:28
颜发才(Yan Facai) [Spark 2.0] ClassNotFoundException is thrown when using Hive Thu, 18 Aug, 09:47
Michał Zieliński Re: Spark ML : One hot Encoding for multiple columns Wed, 17 Aug, 17:54
Michał Zieliński Re: VectorUDT with spark.ml.linalg.Vector Wed, 17 Aug, 18:46
Andrés Ivaldi Spark 2 and Solr Mon, 01 Aug, 14:56
Andrés Ivaldi Spark 1.6.1 and regexp_replace Tue, 09 Aug, 16:18
Andrés Ivaldi Aggregations with scala pairs Wed, 17 Aug, 14:01
Andrés Ivaldi Re: Aggregations with scala pairs Thu, 18 Aug, 15:35
Maciej Bryński Re: GraphFrames 0.2.0 released Wed, 24 Aug, 17:11
Maximiliano Patricio Méndez Re: Problems initializing SparkUI Mon, 01 Aug, 16:44
Maximiliano Patricio Méndez Re: Problems initializing SparkUI Mon, 01 Aug, 18:08
Maximiliano Patricio Méndez Re: Problems initializing SparkUI Mon, 01 Aug, 20:23
Maximiliano Patricio Méndez Re: Problems initializing SparkUI Mon, 01 Aug, 21:03
Maximiliano Patricio Méndez Re: Problems initializing SparkUI Mon, 01 Aug, 21:23
Otis Gospodnetić Re: Spark Executor Metrics Tue, 16 Aug, 18:31
Otis Gospodnetić Spark metrics when running with YARN? Tue, 30 Aug, 05:53
Otis Gospodnetić Re: Spark metrics when running with YARN? Tue, 30 Aug, 12:43
Renato Marroquín Mogrovejo Re: mutable.LinkedHashMap kryo serialization issues Fri, 26 Aug, 06:13
Renato Marroquín Mogrovejo Reading parquet files into Spark Streaming Fri, 26 Aug, 15:42
Renato Marroquín Mogrovejo Re: Reading parquet files into Spark Streaming Fri, 26 Aug, 21:56
Renato Marroquín Mogrovejo Re: Reading parquet files into Spark Streaming Sat, 27 Aug, 13:24
Wojciech Pituła Re: submitting spark job with kerberized Hadoop issue Sat, 06 Aug, 13:29
Yann-Aël Le Borgne Re: Avoid Cartesian product in calculating a distance matrix? Sat, 06 Aug, 08:54
Yann-Aël Le Borgne Re: UDF in SparkR Wed, 17 Aug, 16:26
陈宇航 issue with coalesce in Spark 2.0.0 Tue, 02 Aug, 09:57
莫涛 答复: how to generate a column using mapParition and then add it back to the df? Mon, 08 Aug, 09:44
莫涛 答复: 答复: how to generate a column using mapParition and then add it back to the df? Tue, 09 Aug, 02:14
Cleosson José Pirani de Souza ApplicationMaster + Fair Scheduler + Dynamic resource allocation Tue, 30 Aug, 11:30
Dávid Szakállas updateStateByKey for window batching Mon, 22 Aug, 10:49
Jörn Franke Re: Extracting key word from a textual column Tue, 02 Aug, 21:29
Jörn Franke Re: Extracting key word from a textual column Wed, 03 Aug, 05:32
Jörn Franke Re: Extracting key word from a textual column Wed, 03 Aug, 05:32
Jörn Franke Re: Does Spark SQL support indexes? Sun, 14 Aug, 06:13
Jörn Franke Re: how to do nested loops over 2 arrays but use Two RDDs instead ? Mon, 15 Aug, 19:11
Jörn Franke Re: Spark Yarn executor container memory Tue, 16 Aug, 05:34
Jörn Franke Re: How to Improve Random Forest classifier accuracy Thu, 18 Aug, 08:46
Jörn Franke Re: Best way to read XML data from RDD Sat, 20 Aug, 06:10
@Sanjiv Singh Re: Spark SQL Parallelism - While reading from Oracle Wed, 10 Aug, 15:28
Aakash Basu Unsubscribe Tue, 09 Aug, 18:24
Aasish Kumar Re: Spark Streaming Job Keeps growing memory over time Tue, 09 Aug, 12:19
Abhishek Ranjan Relative path in absolute URI Wed, 03 Aug, 06:54
Adam Roberts Re: Spark build 1.6.2 error Wed, 31 Aug, 13:19
Adamantios Corais Grid Search using Spark MLLib Pipelines Fri, 12 Aug, 16:17
Adamantios Corais Re: Grid Search using Spark MLLib Pipelines Fri, 12 Aug, 18:24
Adamantios Corais Best range of parameters for grid search? Wed, 24 Aug, 09:26
Aditya SparkStreaming source code Thu, 18 Aug, 12:30
Aditya Re: [Spark 2.0] ClassNotFoundException is thrown when using Hive Thu, 18 Aug, 12:35
Aditya Re: JavaRDD to DataFrame fails with null pointer exception in 1.6.0 Thu, 18 Aug, 12:43
Adonis Settouf Re: SparkStreaming source code Thu, 18 Aug, 12:39
Adrian Bridgett coalesce serialising earlier work Tue, 09 Aug, 07:11
Adrian Bridgett 2.0.1/2.1.x release dates Thu, 18 Aug, 09:35
Ahmed El-Gamal Sorting a DStream and taking topN Sun, 07 Aug, 14:43
Ahmed El-Gamal Sorting a DStream and taking topN Sun, 07 Aug, 14:46
Ahmed Sadek Why training data in Kmeans Spark streaming clustering Thu, 11 Aug, 16:14
Ai Deng reuse the Spark SQL internal metrics Tue, 30 Aug, 21:17
Akhilesh Pathodia Re: Reading parquet files into Spark Streaming Sat, 27 Aug, 05:01
Akhilesh Pathodia Re: Reading parquet files into Spark Streaming Sat, 27 Aug, 07:01
AlexModestov work with russian letters Wed, 24 Aug, 09:37
Alexander Peletz pyspark.sql.functions.last not working as expected Wed, 17 Aug, 15:56
Alexander Peletz RE: pyspark.sql.functions.last not working as expected Thu, 18 Aug, 00:47
Alexander Peletz RE: pyspark.sql.functions.last not working as expected Thu, 18 Aug, 14:12
Alexey Svyatkovskiy Re: VectorUDT with spark.ml.linalg.Vector Wed, 17 Aug, 04:31
Alonso Isidoro Roman Re: Design patterns involving Spark Tue, 30 Aug, 07:33
Alonso Isidoro Roman Re: Design patterns involving Spark Tue, 30 Aug, 08:10
Amit Sela Re: spark 2.0 readStream from a REST API Mon, 01 Aug, 16:44
Amit Sela Dropping late date in Structured Streaming Sat, 06 Aug, 13:40
Andrei Ivanov Spark 2.0 History Server Storage Mon, 01 Aug, 21:10
Andrei Ivanov Re: Spark 2.0 History Server Storage Tue, 02 Aug, 18:07
Andrei Ivanov Re: Spark 2.0 History Server Storage Tue, 02 Aug, 18:27
Andrew Ehrlich Re: How to write contents of RDD to HDFS as separate file for each item in RDD (PySpark) Mon, 01 Aug, 00:18
Andrew Ehrlich Re: Tuning level of Parallelism: Increase or decrease? Mon, 01 Aug, 00:27
Andrew Ehrlich Re: Changing Spark configuration midway through application. Wed, 10 Aug, 16:59
Andrew Vykhodtsev Can't connect to remote spark standalone cluster: getting WARN TaskSchedulerImpl: Initial job has not accepted any resources Tue, 16 Aug, 23:24
Andy Davidson python 'Jupyter' data frame problem with autocompletion Mon, 01 Aug, 17:08
Andy Davidson Re: spark 1.6.0 read s3 files error. Tue, 02 Aug, 16:54
Andy Davidson FW: [jupyter] newbie. apache spark python3 'Jupyter' data frame problem with auto completion and accessing documentation Tue, 02 Aug, 16:57
Andy Davidson py4j.Py4JException: Method lower([class java.lang.String]) does not exist Thu, 18 Aug, 17:07
Andy Davidson Py4JJavaError: An error occurred while calling None.org.apache.spark.sql.hive.HiveContext. Thu, 18 Aug, 18:34
Andy Davidson pyspark unable to create UDF: java.lang.RuntimeException: org.apache.hadoop.fs.FileAlreadyExistsException: Parent path is not a directory: /tmp tmp Thu, 18 Aug, 21:56
Andy Davidson Re: pyspark unable to create UDF: java.lang.RuntimeException: org.apache.hadoop.fs.FileAlreadyExistsException: Parent path is not a directory: /tmp tmp Fri, 19 Aug, 00:11
Andy Grove Regression in Java RDD sortBy() in Spark 2.0 Fri, 05 Aug, 04:25
Message list1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · Next »Thread · Author · Date
Box list
May 2019274
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137