spark-user mailing list archives: March 2016

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · Next »Thread · Author · Date
James Hammerton Re: Find all invoices more than 6 months from csv file Tue, 22 Mar, 10:32
James Hammerton Re: Work out date column in CSV more than 6 months old (datediff or something) Tue, 22 Mar, 11:36
James Jia Spark reduce serialization question Fri, 04 Mar, 21:11
Jatin Kumar Spark streaming from Kafka best fit Tue, 01 Mar, 08:36
Jatin Kumar Re: Spark streaming from Kafka best fit Tue, 01 Mar, 17:59
Jatin Kumar Streaming UI tab misleading for window operations Sun, 06 Mar, 12:06
Jatin Kumar Re: Streaming UI tab misleading for window operations Sun, 06 Mar, 17:56
Jatin Kumar Re: Streaming UI tab misleading for window operations Sun, 06 Mar, 18:19
Jatin Kumar Performance tuning of spark pipeline Mon, 14 Mar, 12:19
Jatin Kumar Re: [Streaming] Difference between windowed stream and stream with large batch size? Tue, 22 Mar, 09:49
Jatin Kumar Spark Streaming UI duration numbers mismatch Tue, 22 Mar, 10:00
Jatin Kumar Re: sliding Top N window Tue, 22 Mar, 12:23
Jatin Kumar Re: sliding Top N window Tue, 22 Mar, 12:49
Jatin Kumar Re: sliding Top N window Wed, 23 Mar, 02:41
Jatin Kumar Re: Spark Streaming UI duration numbers mismatch Wed, 23 Mar, 19:20
Jatin Kumar Re: Spark Streaming UI duration numbers mismatch Wed, 30 Mar, 12:06
Jeff Zhang Support virtualenv in PySpark Tue, 01 Mar, 05:07
Jeff Zhang Re: Save DataFrame to Hive Table Tue, 01 Mar, 06:33
Jeff Zhang Re: Support virtualenv in PySpark Tue, 01 Mar, 08:29
Jeff Zhang Re: Converting array to DF Tue, 01 Mar, 08:51
Jeff Zhang Re: Is spark.driver.maxResultSize used correctly ? Tue, 01 Mar, 09:24
Jeff Zhang Re: Spark executor killed without apparent reason Wed, 02 Mar, 02:07
Jeff Zhang Re: Spark on Yarn with Dynamic Resource Allocation. Container always marked as failed Thu, 03 Mar, 00:30
Jeff Zhang Re: Renaming sc variable in sparkcontext throws task not serializable Thu, 03 Mar, 03:18
Jeff Zhang Re: OOM exception during Broadcast Mon, 07 Mar, 23:34
Jeff Zhang Re: Setting PYSPARK_PYTHON in spark-env.sh vs from driver program Tue, 08 Mar, 01:05
Jeff Zhang Re: Saving multiple outputs in the same job Wed, 09 Mar, 08:07
Jeff Zhang Re: what is the pyspark inverse of registerTempTable()? Tue, 15 Mar, 23:44
Jeff Zhang Re: what is the pyspark inverse of registerTempTable()? Wed, 16 Mar, 00:42
Jeff Zhang Re: Spark Thriftserver Wed, 16 Mar, 00:44
Jeff Zhang Re: Spark UI Completed Jobs Wed, 16 Mar, 00:50
Jeff Zhang Re: Spark Thriftserver Wed, 16 Mar, 02:44
Jeff Zhang Re: Spark Thriftserver Wed, 16 Mar, 02:49
Jeff Zhang Re: Job failed while submitting python to yarn programatically Wed, 16 Mar, 03:05
Jeff Zhang Re: exception while running job as pyspark Wed, 16 Mar, 07:06
Jeff Zhang Re: DataFrame vs RDD Wed, 23 Mar, 02:26
Jeff Zhang Re: [Critical] Issue with cached RDDs created from hadoop sequence files Wed, 23 Mar, 03:37
Jeff Zhang Re: [Critical] Issue with cached RDDs created from hadoop sequence files Wed, 23 Mar, 03:58
Jeff Zhang Re: [Critical] Issue with cached RDDs created from hadoop sequence files Wed, 23 Mar, 04:00
Jeff Zhang Re: run spark job Tue, 29 Mar, 07:47
Jeff Zhang Re: pyspark unable to convert dataframe column to a vector: Unable to instantiate org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient Wed, 30 Mar, 05:34
Jeff Zhang Re: sqlContext.cacheTable + yarn client mode Thu, 31 Mar, 05:26
Jelez Raditchkov S3 DirectParquetOutputCommitter + PartitionBy + SaveMode.Append Fri, 04 Mar, 20:59
Jelez Raditchkov How to get the singleton instance of SQLContext/HiveContext: val sqlContext = SQLContext.getOrCreate(rdd.sparkContext) Fri, 04 Mar, 21:05
Jelez Raditchkov Best way to merge files from streaming jobs Fri, 04 Mar, 21:09
Jelez Raditchkov FW: How to get the singleton instance of SQLContext/HiveContext: val sqlContext = SQLContext.getOrCreate(rdd.sparkContext)‏ Fri, 04 Mar, 22:09
Jelez Raditchkov RE: Error building a self contained Spark app Fri, 04 Mar, 23:00
Jerry Lam [Spark SQL] Unexpected Behaviour Mon, 28 Mar, 21:34
Jerry Lam Re: [Spark SQL] Unexpected Behaviour Tue, 29 Mar, 06:01
Jerry Lam Re: [Spark SQL] Unexpected Behaviour Tue, 29 Mar, 06:33
Jerry Lam Re: [Spark SQL] Unexpected Behaviour Tue, 29 Mar, 11:38
Jerry Lam Re: [Spark SQL] Unexpected Behaviour Tue, 29 Mar, 11:43
Jesse F Chen select count(*) return wrong row counts Thu, 03 Mar, 02:38
Jesse F Chen Re: Configuring/Optimizing Spark Fri, 04 Mar, 01:38
Jesse F Chen Re: OOM Exception in my spark streaming application Wed, 16 Mar, 17:43
Jialin Liu Re: spark launching range is 10 mins Sun, 20 Mar, 06:34
Jim Carroll Non-classification neural networks Sun, 27 Mar, 13:24
John Lilley RE: Graphx Fri, 11 Mar, 12:46
John Lilley RE: Graphx Fri, 11 Mar, 15:02
John Lilley RE: Graphx Fri, 11 Mar, 15:46
John Lilley RE: Graphx Fri, 11 Mar, 16:03
John Lilley RE: Graphx Fri, 11 Mar, 16:07
John Lilley RE: Graphx Fri, 11 Mar, 18:22
John Lilley RE: Graphx Fri, 11 Mar, 18:25
John Radin Apache Spark-Get All Field Names From Nested Arbitrary JSON Files Thu, 31 Mar, 22:08
Jonathan Kelly Re: scikit learn on EMR PySpark Wed, 02 Mar, 00:21
Jonathan Kelly Re: Configure Spark Resource on AWS CLI Not Working Wed, 02 Mar, 01:01
JoneZhang Does parallelize and collect preserve the original order of list? Wed, 16 Mar, 02:16
Jong Wook Kim Re: AVRO vs Parquet Fri, 04 Mar, 04:48
Jorge Machado Re: Does SparkSql has official jdbc/odbc driver? Tue, 29 Mar, 08:23
Jorge Machado Re: Does SparkSql has official jdbc/odbc driver? Tue, 29 Mar, 08:33
Jorge Machado Re: Does SparkSql has official jdbc/odbc driver? Tue, 29 Mar, 08:35
Joseph The build-in indexes in ORC file does not work. Wed, 16 Mar, 10:23
Joseph Re: Re: The build-in indexes in ORC file does not work. Wed, 16 Mar, 13:46
Joseph Spark: The build-in indexes in ORC file do not work. Mon, 21 Mar, 03:33
Joseph Bradley Merging ML Estimator and Model Mon, 21 Mar, 18:53
Joseph Bradley Re: SparkML algos limitations question. Mon, 21 Mar, 20:24
Joseph Bradley Re: Handling Missing Values in MLLIB Decision Tree Tue, 22 Mar, 17:14
Joseph Bradley Re: SparkML RandomForest java.lang.StackOverflowError Tue, 29 Mar, 19:09
Josh Rosen Does anyone implement org.apache.spark.serializer.Serializer in their own code? Tue, 08 Mar, 02:57
Josh Rosen Re: Python unit tests - Unable to ru it with Python 2.6 or 2.7 Fri, 11 Mar, 18:35
Josh Rosen Re: Apache Spark Exception in thread “main” java.lang.NoClassDefFoundError: scala/collection/GenTraversableOnce$class Thu, 17 Mar, 02:07
Josh Rosen Re: Apache Spark Exception in thread “main” java.lang.NoClassDefFoundError: scala/collection/GenTraversableOnce$class Thu, 17 Mar, 02:08
Josh Rosen Re: Spark master keeps running out of RAM Thu, 31 Mar, 18:22
Joshua Cason Fwd: NoSuchElementException in ChiSqSelector fit method (version 1.6.0) Mon, 28 Mar, 01:50
Joshua Dickerson RE: Steps to Run Spark Scala job from Oozie on EC2 Hadoop clsuter Wed, 23 Mar, 12:56
Joshua Sorrell Does pyspark still lag far behind the Scala API in terms of features Tue, 01 Mar, 13:03
Joshua Sorrell Re: Does pyspark still lag far behind the Scala API in terms of features Thu, 03 Mar, 13:46
Juan Leaniz Re: Streaming job delays Wed, 09 Mar, 16:45
Jules Damji Re: Does pyspark still lag far behind the Scala API in terms of features Tue, 01 Mar, 17:07
Jules Damji Re: shuffle in spark Mon, 14 Mar, 22:19
Jy Chen Fwd: Dynamic allocation doesn't work on YARN Wed, 09 Mar, 08:29
Jy Chen Re: Dynamic allocation doesn't work on YARN Thu, 10 Mar, 02:09
Jy Chen Fwd: Dynamic allocation doesn't work on YARN Thu, 10 Mar, 02:18
Jy Chen Re: Dynamic allocation doesn't work on YARN Thu, 10 Mar, 08:03
Jyothi Mandava getting null values from hive partitioned table after upgrading Spark to 1.5.0 Thu, 03 Mar, 22:51
Kabeer Ahmed Re: Adding hive context gives error Tue, 08 Mar, 00:34
Kalpit Shah Re: Changing number of workers for benchmarking purposes Mon, 14 Mar, 22:47
Karan Kumar Re: [Proposal] Enabling time series analysis on spark metrics Tue, 01 Mar, 16:17
Karan Kumar Re: [Proposal] Enabling time series analysis on spark metrics Thu, 03 Mar, 17:52
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · Next »Thread · Author · Date
Box list
Sep 2021109
Aug 2021171
Jul 2021158
Jun 2021179
May 2021187
Apr 2021267
Mar 2021346
Feb 2021166
Jan 2021242
Dec 2020203
Nov 2020147
Oct 2020236
Sep 2020136
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137