spark-user mailing list archives: July 2016

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · Next »Thread · Author · Date
Omid Alipourfard Question regarding structured data and partitions Thu, 07 Jul, 03:55
Koert Kuipers   Re: Question regarding structured data and partitions Thu, 07 Jul, 04:30
tan shai     Re: Question regarding structured data and partitions Thu, 07 Jul, 08:24
Koert Kuipers       Re: Question regarding structured data and partitions Thu, 07 Jul, 14:59
tan shai         Re: Question regarding structured data and partitions Thu, 07 Jul, 15:07
Lan Jiang Processing json document Thu, 07 Jul, 05:48
Jean Georges Perrin   Re: Processing json document Thu, 07 Jul, 06:09
Hyukjin Kwon   Re: Processing json document Thu, 07 Jul, 06:12
Jörn Franke     Re: Processing json document Thu, 07 Jul, 06:42
Hyukjin Kwon   Re: Processing json document Thu, 07 Jul, 06:47
Lan Jiang     Re: Processing json document Thu, 07 Jul, 16:57
Yong Zhang       RE: Processing json document Thu, 07 Jul, 17:06
Hyukjin Kwon         RE: Processing json document Fri, 08 Jul, 04:09
Jörn Franke         Re: Processing json document Fri, 08 Jul, 07:54
Re: MLLib SVMWithSGD is failing for large dataset
Chitturi Padma   Re: MLLib SVMWithSGD is failing for large dataset Thu, 07 Jul, 06:17
linxi zeng SparkSQL Added file get Exception: is a directory and recursive is not turned on Thu, 07 Jul, 06:18
SPARK-8813 - combining small files in spark sql
Ajay Srivastava   SPARK-8813 - combining small files in spark sql Thu, 07 Jul, 06:53
Puneet Tripathi Spark with HBase Error - Py4JJavaError Thu, 07 Jul, 07:11
Puneet Tripathi   RE: Spark with HBase Error - Py4JJavaError Thu, 07 Jul, 11:49
ram kumar     Re: Spark with HBase Error - Py4JJavaError Thu, 07 Jul, 13:21
Puneet Tripathi       RE: Spark with HBase Error - Py4JJavaError Fri, 08 Jul, 13:39
Mungeol Heo stddev_samp() gives NaN Thu, 07 Jul, 08:23
Sean Owen   Re: stddev_samp() gives NaN Thu, 07 Jul, 08:29
Mich Talebzadeh     Re: stddev_samp() gives NaN Thu, 07 Jul, 08:37
Sean Owen       Re: stddev_samp() gives NaN Thu, 07 Jul, 08:39
Mich Talebzadeh         Re: stddev_samp() gives NaN Thu, 07 Jul, 08:41
Sean Owen           Re: stddev_samp() gives NaN Thu, 07 Jul, 08:55
Mungeol Heo         Re: stddev_samp() gives NaN Thu, 07 Jul, 08:51
Sean Owen           Re: stddev_samp() gives NaN Thu, 07 Jul, 08:57
Mungeol Heo             Re: stddev_samp() gives NaN Fri, 08 Jul, 02:05
tan shai Optimize filter operations with sorted data Thu, 07 Jul, 09:25
Ted Yu   Re: Optimize filter operations with sorted data Thu, 07 Jul, 09:43
Chanh Le     Re: Optimize filter operations with sorted data Thu, 07 Jul, 09:58
tan shai       Re: Optimize filter operations with sorted data Thu, 07 Jul, 11:40
Chanh Le         Re: Optimize filter operations with sorted data Thu, 21 Jul, 07:30
tan shai     Re: Optimize filter operations with sorted data Thu, 07 Jul, 11:39
SamyaMaiti Spark streaming Kafka Direct API + Multiple consumers Thu, 07 Jul, 09:34
Rabin Banerjee   Re: Spark streaming Kafka Direct API + Multiple consumers Thu, 07 Jul, 10:53
Arnaud Bailly Multiple aggregations over streaming dataframes Thu, 07 Jul, 10:18
Sivakumaran S   Re: Multiple aggregations over streaming dataframes Thu, 07 Jul, 10:55
Arnaud Bailly     Re: Multiple aggregations over streaming dataframes Thu, 07 Jul, 12:06
Sivakumaran S       Re: Multiple aggregations over streaming dataframes Thu, 07 Jul, 12:17
Arnaud Bailly         Re: Multiple aggregations over streaming dataframes Thu, 07 Jul, 15:59
Michael Armbrust           Re: Multiple aggregations over streaming dataframes Thu, 07 Jul, 21:31
Andy Davidson             Re: Multiple aggregations over streaming dataframes Thu, 07 Jul, 22:00
Arnaud Bailly               Re: Multiple aggregations over streaming dataframes Fri, 08 Jul, 08:00
kevin ClassNotFoundException: org.apache.parquet.hadoop.ParquetOutputCommitter Thu, 07 Jul, 10:47
Bryan Cutler   Re: ClassNotFoundException: org.apache.parquet.hadoop.ParquetOutputCommitter Thu, 07 Jul, 15:50
brccosta RDD and Dataframes Thu, 07 Jul, 11:20
Rishi Mishra   Re: RDD and Dataframes Thu, 07 Jul, 12:10
Bruno Costa     Re: RDD and Dataframes Thu, 07 Jul, 12:25
RK Aduri   Re: RDD and Dataframes Fri, 15 Jul, 23:53
Taotao.Li     Re: RDD and Dataframes Sat, 16 Jul, 01:45
luohui20...@sina.com 回复:Re: how to select first 50 value of each group after group by? Thu, 07 Jul, 11:26
Anton Okolnychyi   Re: Re: how to select first 50 value of each group after group by? Thu, 07 Jul, 12:38
Mich Talebzadeh   Re: Re: how to select first 50 value of each group after group by? Thu, 07 Jul, 13:06
Michal Vince problem extracting map from json Thu, 07 Jul, 12:18
Sivakumaran S   Re: problem extracting map from json Thu, 07 Jul, 12:23
pseudo oduesp categoricalFeaturesInfo Thu, 07 Jul, 13:12
tan shai Extend Dataframe API Thu, 07 Jul, 13:31
Koert Kuipers   Re: Extend Dataframe API Thu, 07 Jul, 15:07
tan shai     Re: Extend Dataframe API Thu, 07 Jul, 15:09
Rishi Mishra       Re: Extend Dataframe API Fri, 08 Jul, 08:28
Patrick Woody Spark 1.6.2 short circuit AND filter broken Thu, 07 Jul, 15:10
Andy Davidson spark streaming: how come I have scheduling delay when processing time is less then batch windowing size Thu, 07 Jul, 17:47
Andy Davidson   Re: spark streaming: how come I have scheduling delay when processing time is less then batch windowing size Thu, 07 Jul, 20:09
Spark as sql engine on S3
Ashok Kumar   Spark as sql engine on S3 Thu, 07 Jul, 17:50
ayan guha     Re: Spark as sql engine on S3 Fri, 08 Jul, 04:27
Ashok Kumar       Re: Spark as sql engine on S3 Fri, 08 Jul, 05:03
ayan guha         Re: Spark as sql engine on S3 Fri, 08 Jul, 05:30
Ashok Kumar           Re: Spark as sql engine on S3 Fri, 08 Jul, 09:49
Mich Talebzadeh             Re: Spark as sql engine on S3 Fri, 08 Jul, 14:36
Robert Towne spark read from http endpoint? Thu, 07 Jul, 20:39
Andy Davidson is dataframe.write() async? Streaming performance problem Thu, 07 Jul, 20:59
Cody Koeninger   Re: is dataframe.write() async? Streaming performance problem Fri, 08 Jul, 14:31
Ewan Leith     RE: is dataframe.write() async? Streaming performance problem Fri, 08 Jul, 15:52
Re: Compute pairwise distance
Manoj Awasthi   Re: Compute pairwise distance Fri, 08 Jul, 03:13
Debasish Das     Re: Compute pairwise distance Fri, 08 Jul, 04:43
Chanh Le Any ways to connect BI tool to Spark without Hive Fri, 08 Jul, 03:19
Mich Talebzadeh   Re: Any ways to connect BI tool to Spark without Hive Fri, 08 Jul, 03:49
Chanh Le     Re: Any ways to connect BI tool to Spark without Hive Fri, 08 Jul, 03:58
Mich Talebzadeh       Re: Any ways to connect BI tool to Spark without Hive Fri, 08 Jul, 04:18
Chanh Le         Re: Any ways to connect BI tool to Spark without Hive Fri, 08 Jul, 04:29
ayan guha       Re: Any ways to connect BI tool to Spark without Hive Fri, 08 Jul, 04:21
Chanh Le         Re: Any ways to connect BI tool to Spark without Hive Fri, 08 Jul, 04:34
ayan guha           Re: Any ways to connect BI tool to Spark without Hive Fri, 08 Jul, 05:00
Mich Talebzadeh         Re: Any ways to connect BI tool to Spark without Hive Fri, 08 Jul, 04:55
Re: Custom Spark Error on Hadoop Cluster
Xiangrui Meng   Re: Custom Spark Error on Hadoop Cluster Fri, 08 Jul, 05:32
Xiangrui Meng     Re: Custom Spark Error on Hadoop Cluster Mon, 11 Jul, 19:23
Xiangrui Meng       Re: Custom Spark Error on Hadoop Cluster Mon, 18 Jul, 13:41
luohui20...@sina.com 回复:Re: Re: how to select first 50 value of each group after group by? Fri, 08 Jul, 06:20
aasish.kumar Memory grows exponentially Fri, 08 Jul, 06:56
Jörn Franke   Re: Memory grows exponentially Fri, 08 Jul, 07:44
Cody Koeninger     Re: Memory grows exponentially Fri, 08 Jul, 14:25
Sea Bug about reading parquet files Fri, 08 Jul, 08:33
Cheng Lian   Re: Bug about reading parquet files Fri, 08 Jul, 08:47
Sea     回复: Bug about reading parquet files Fri, 08 Jul, 12:44
Cheng Lian       Re: 回复: Bug about reading parquet files Sat, 09 Jul, 07:57
Mungeol Heo How to improve the performance for writing a data frame to a JDBC database? Fri, 08 Jul, 09:23
Mikael Ståldal Why is KafkaUtils.createRDD offsetRanges an Array rather than a Seq? Fri, 08 Jul, 09:42
Sean Owen   Re: Why is KafkaUtils.createRDD offsetRanges an Array rather than a Seq? Fri, 08 Jul, 10:55
Cody Koeninger     Re: Why is KafkaUtils.createRDD offsetRanges an Array rather than a Seq? Fri, 08 Jul, 14:22
Mikael Ståldal Is the operation inside foreachRDD supposed to be blocking? Fri, 08 Jul, 09:43
Sean Owen   Re: Is the operation inside foreachRDD supposed to be blocking? Fri, 08 Jul, 10:56
Mikael Ståldal     Re: Is the operation inside foreachRDD supposed to be blocking? Fri, 08 Jul, 12:56
Sean Owen       Re: Is the operation inside foreachRDD supposed to be blocking? Fri, 08 Jul, 13:11
Mikael Ståldal         Re: Is the operation inside foreachRDD supposed to be blocking? Fri, 08 Jul, 13:26
tan shai [No Subject] Fri, 08 Jul, 10:58
tan shai RangePartitioning Fri, 08 Jul, 11:39
Mazen Simultaneous spark Jobs execution. Fri, 08 Jul, 12:03
Jacek Laskowski   Re: Simultaneous spark Jobs execution. Fri, 08 Jul, 13:09
Mich Talebzadeh     Re: Simultaneous spark Jobs execution. Fri, 08 Jul, 14:15
vimal dinakaran spark logging best practices Fri, 08 Jul, 12:56
Punit Naik Spark Terasort Help Fri, 08 Jul, 15:57
Pasquinell Urbani Iterate over columns in sql.dataframe Fri, 08 Jul, 16:08
Andy Davidson can I use ExectorService in my driver? was: is dataframe.write() async? Streaming performance problem Fri, 08 Jul, 17:29
Ewan Leith   Re: can I use ExectorService in my driver? was: is dataframe.write() async? Streaming performance problem Fri, 08 Jul, 17:38
Ellis, Tom (Financial Markets IT) Unresponsive Spark Streaming UI in YARN cluster mode - 1.5.2 Fri, 08 Jul, 17:30
Shixiong(Ryan) Zhu   Re: Unresponsive Spark Streaming UI in YARN cluster mode - 1.5.2 Fri, 08 Jul, 18:10
Ellis, Tom (Financial Markets IT)     RE: Unresponsive Spark Streaming UI in YARN cluster mode - 1.5.2 Fri, 08 Jul, 18:14
Shixiong(Ryan) Zhu       Re: Unresponsive Spark Streaming UI in YARN cluster mode - 1.5.2 Fri, 08 Jul, 18:21
Ellis, Tom (Financial Markets IT)         RE: Unresponsive Spark Streaming UI in YARN cluster mode - 1.5.2 Fri, 08 Jul, 20:47
Pedro Rodriguez DataFrame Min By Column Fri, 08 Jul, 19:57
Xinh Huynh   Re: DataFrame Min By Column Sat, 09 Jul, 00:06
Pedro Rodriguez     Re: DataFrame Min By Column Sat, 09 Jul, 07:32
Pedro Rodriguez       Re: DataFrame Min By Column Sat, 09 Jul, 13:10
Michael Armbrust         Re: DataFrame Min By Column Sat, 09 Jul, 20:18
Pedro Rodriguez           Re: DataFrame Min By Column Sat, 09 Jul, 21:20
Michael Armbrust             Re: DataFrame Min By Column Sun, 10 Jul, 05:46
dsp Isotonic Regression, run method overloaded Error Fri, 08 Jul, 20:38
Yanbo Liang   Re: Isotonic Regression, run method overloaded Error Mon, 11 Jul, 05:20
Fridtjof Sander   Re: Isotonic Regression, run method overloaded Error Mon, 11 Jul, 13:14
Yanbo Liang     Re: Isotonic Regression, run method overloaded Error Mon, 11 Jul, 15:06
Fridtjof Sander       Re: Isotonic Regression, run method overloaded Error Mon, 11 Jul, 15:19
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · Next »Thread · Author · Date
Box list
Oct 2020218
Sep 2020136
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137