spark-user mailing list archives: June 2017

Site index · List index
Message list1 · 2 · 3 · 4 · 5 · 6 · 7 · Next »Thread · Author · Date
张明磊 Why my project has this kind of error ? Tue, 20 Jun, 05:54
Александр Крашенинников Saving RDD as Kryo (broken in 2.1) Mon, 26 Jun, 08:17
Даша Ковальчук [Spark Core] Does spark support read from remote Hive server via JDBC Thu, 08 Jun, 08:16
Даша Ковальчук Re: [Spark Core] Does spark support read from remote Hive server via JDBC Thu, 08 Jun, 15:41
Даша Ковальчук Re: [Spark Core] Does spark support read from remote Hive server via JDBC Thu, 08 Jun, 17:30
Даша Ковальчук Re: [Spark Core] Does spark support read from remote Hive server via JDBC Fri, 09 Jun, 07:12
Даша Ковальчук Re: [Spark Core] Does spark support read from remote Hive server via JDBC Fri, 09 Jun, 10:01
李斌松 Does spark support hive table(parquet) column renaming? Mon, 19 Jun, 13:19
颜发才(Yan Facai) Re: Adding header to an rdd before saving to text file Tue, 06 Jun, 05:38
颜发才(Yan Facai) Re: Convert the feature vector to raw data Wed, 07 Jun, 09:23
颜发才(Yan Facai) Re: LibSVM should have just one input file Mon, 12 Jun, 05:44
颜发才(Yan Facai) Re: [How-To] Migrating from mllib.tree.DecisionTree to ml.regression.DecisionTreeRegressor Fri, 16 Jun, 05:34
颜发才(Yan Facai) Re: Best alternative for Category Type in Spark Dataframe Fri, 16 Jun, 05:42
颜发才(Yan Facai) Re: Best alternative for Category Type in Spark Dataframe Sat, 17 Jun, 22:53
颜发才(Yan Facai) Re: Best alternative for Category Type in Spark Dataframe Sun, 18 Jun, 03:08
颜发才(Yan Facai) Re: [ML] Stop conditions for RandomForest Wed, 28 Jun, 09:17
Sahib Aulakh [Search] ­ Re: Question about mllib.recommendation.ALS Thu, 08 Jun, 15:17
Sahib Aulakh [Search] ­ Re: Question about mllib.recommendation.ALS Thu, 08 Jun, 15:42
Andrés Ivaldi UDF percentile_approx Tue, 13 Jun, 18:52
Andrés Ivaldi Re: UDF percentile_approx Wed, 14 Jun, 11:47
Herman van Hövell tot Westerflier Re: Question on Spark code Sun, 25 Jun, 09:37
萝卜丝炒饭 a stage can belong to more than one job please? Tue, 06 Jun, 12:04
萝卜丝炒饭 Re: a stage can belong to more than one job please? Wed, 07 Jun, 01:06
萝卜丝炒饭 how to debug app with cluster mode please? Tue, 13 Jun, 12:49
萝卜丝炒饭 the dependence length of RDD, can its size be greater than 1 pleaae? Thu, 15 Jun, 08:11
萝卜丝炒饭 Re: the dependence length of RDD, can its size be greater than 1 pleaae? Fri, 16 Jun, 02:11
萝卜丝炒饭 the scheme in stream reader Sun, 18 Jun, 07:27
萝卜丝炒饭 Re: the scheme in stream reader Tue, 20 Jun, 01:46
萝卜丝炒饭 the meaning of partition column and bucket column please? Tue, 20 Jun, 02:00
萝卜丝炒饭 issue about the windows slice of stream Sat, 24 Jun, 06:51
萝卜丝炒饭 the compile of spark stoped without any hints, would you like help me please? Sun, 25 Jun, 12:29
萝卜丝炒饭 Re: issue about the windows slice of stream Sun, 25 Jun, 13:44
萝卜丝炒饭 Re: Spark Streaming reduceByKeyAndWindow with inverse function seems toiterate over all the keys in the window even though they are not presentin the current batch Tue, 27 Jun, 00:52
萝卜丝炒饭 Re: issue about the windows slice of stream Tue, 27 Jun, 00:58
萝卜丝炒饭 Re: What is the real difference between Kafka streaming and Spark Streaming? Tue, 27 Jun, 01:35
萝卜丝炒饭 how to mention others in JIRA comment please? Tue, 27 Jun, 01:56
萝卜丝炒饭 Re: how to mention others in JIRA comment please? Tue, 27 Jun, 02:54
萝卜丝炒饭 Re: Question about Parallel Stages in Spark Tue, 27 Jun, 03:33
萝卜丝炒饭 Re: Question about Parallel Stages in Spark Tue, 27 Jun, 03:47
萝卜丝炒饭 the function of countByValueAndWindow and foreachRDD in DStream, would you like help me understand it please? Tue, 27 Jun, 15:08
萝卜丝炒饭 Re: How do I find the time taken by each step in a stage in a Spark Job Wed, 28 Jun, 07:51
Jörn Franke Re: An Architecture question on the use of virtualised clusters Thu, 01 Jun, 07:21
Jörn Franke Re: Java SPI jar reload in Spark Tue, 06 Jun, 09:55
Jörn Franke Re: Performance issue when running Spark-1.6.1 in yarn-client mode with Hadoop 2.6.0 Wed, 07 Jun, 05:46
Jörn Franke Re: [CSV] If number of columns of one row bigger than maxcolumns it stop the whole parsing process. Wed, 07 Jun, 16:45
Jörn Franke Re: Scala, Python or Java for Spark programming Wed, 07 Jun, 16:51
Jörn Franke Re: [CSV] If number of columns of one row bigger than maxcolumns it stop the whole parsing process. Thu, 08 Jun, 05:10
Jörn Franke Re: Scala, Python or Java for Spark programming Thu, 08 Jun, 06:44
Jörn Franke Re: [CSV] If number of columns of one row bigger than maxcolumns it stop the whole parsing process. Thu, 08 Jun, 07:47
Jörn Franke Re: [Spark JDBC] Does spark support read from remote Hive server via JDBC Sun, 11 Jun, 11:24
Jörn Franke Re: Spark Streaming Design Suggestion Tue, 13 Jun, 20:47
Jörn Franke Re: fetching and joining data from two different clusters Thu, 15 Jun, 16:05
Jörn Franke Re: fetching and joining data from two different clusters Thu, 15 Jun, 19:42
Jörn Franke Re: fetching and joining data from two different clusters Thu, 15 Jun, 20:27
Jörn Franke Re: fetching and joining data from two different clusters Sun, 18 Jun, 20:11
Jörn Franke Re: Using Spark as a simulator Tue, 20 Jun, 14:12
Jörn Franke Re: "Sharing" dataframes... Tue, 20 Jun, 18:09
Jörn Franke Re: PySpark working with Generators Fri, 30 Jun, 14:16
ALunar Beach Spark Streaming Checkpoint and Exactly Once Guarantee on Kafka Direct Stream Mon, 05 Jun, 18:14
ALunar Beach Re: Spark Streaming Checkpoint and Exactly Once Guarantee on Kafka Direct Stream Tue, 06 Jun, 12:55
Aakash Basu Re: Use SQL Script to Write Spark SQL Jobs Mon, 12 Jun, 19:14
Aakash Basu Repartition vs PartitionBy Help/Understanding needed Thu, 15 Jun, 09:27
Aakash Basu Fwd: Repartition vs PartitionBy Help/Understanding needed Fri, 16 Jun, 19:04
Aaron Perrin Re: What is the equivalent of mapPartitions in SpqrkSQL? Tue, 27 Jun, 23:50
Aashish Chaudhary ZeroMQ Streaming in Spark2.x Mon, 26 Jun, 18:58
Aashish Chaudhary Re: ZeroMQ Streaming in Spark2.x Mon, 26 Jun, 21:13
Aashish Chaudhary Re: ZeroMQ Streaming in Spark2.x Tue, 27 Jun, 18:10
Abdulfattah Safa Spark Job is stuck at SUBMITTED when set Driver Memory > Executor Memory Sun, 04 Jun, 11:46
Abdulfattah Safa Spark Job is stuck at SUBMITTED when set Driver Memory > Executor Memory Sun, 04 Jun, 11:51
Abhinay Mehta Re: IDE for python Wed, 28 Jun, 09:06
Alaa Zubaidi (PDF) Using YARN w/o HDFS Wed, 21 Jun, 23:50
Alexander Krasheninnikov Saving RDD as Kryo (broken in 2.1) Wed, 21 Jun, 08:39
Alexander Krasheninnikov Fwd: Saving RDD as Kryo (broken in 2.1) Mon, 26 Jun, 08:28
Alonso Isidoro Roman Re: Spark 2.1 - Infering schema of dataframe after reading json files not during Fri, 02 Jun, 17:25
Alonso Isidoro Roman Re: Java SPI jar reload in Spark Tue, 06 Jun, 10:21
Alonso Isidoro Roman Re: Need Spark(Scala) Performance Tuning tips Fri, 09 Jun, 15:38
Anastasios Zouzias Re: Can we access files on Cluster mode Sun, 25 Jun, 08:39
Anastasios Zouzias Re: Can we access files on Cluster mode Sun, 25 Jun, 09:37
Angel Francisco Orta Re: Parquet file generated by Spark, but not compatible read by Hive Tue, 13 Jun, 05:34
AnilKumar B Configurable Task level time outs and task failures Wed, 14 Jun, 18:01
Anita Tailor Unsubscribe Tue, 20 Jun, 17:22
Anita Tailor Unsubscribe Thu, 22 Jun, 04:17
Anita Tailor Unsubscribe Sun, 25 Jun, 04:24
Anthony Thomas [MLLib]: Executor OutOfMemory in BlockMatrix Multiplication Wed, 14 Jun, 23:07
Anthony Thomas Re: [MLLib]: Executor OutOfMemory in BlockMatrix Multiplication Thu, 15 Jun, 00:28
Anton Kravchenko Java access to internal representation of DataTypes.DateType Tue, 13 Jun, 16:16
Anton Kravchenko Re: Java access to internal representation of DataTypes.DateType Wed, 14 Jun, 14:29
Anton Kravchenko access a broadcasted variable from within ForeachPartitionFunction Java API Thu, 15 Jun, 18:29
Anton Kravchenko Re: access a broadcasted variable from within ForeachPartitionFunction Java API Fri, 23 Jun, 20:46
Anton Okolnychyi Re: Incorrect CAST to TIMESTAMP in Hive compatibility Mon, 05 Jun, 19:59
Anubhav Agarwal Re: Creating Dataframe by querying Impala Thu, 01 Jun, 16:34
Arun RowMatrix: tallSkinnyQR Fri, 09 Jun, 14:33
Aseem Bansal Spark 2.1 - Infering schema of dataframe after reading json files not during Fri, 02 Jun, 14:11
Ashok Kumar Edge Node in Spark Mon, 05 Jun, 20:45
Ashok Kumar Re: Edge Node in Spark Tue, 06 Jun, 16:48
Ashok Kumar how many topics spark streaming can handle Mon, 19 Jun, 19:00
Ashok Kumar Re: how many topics spark streaming can handle Mon, 19 Jun, 19:24
Ashok Kumar RDD and DataFrame persistent memory usage Sun, 25 Jun, 07:13
AssafMendelson having trouble using structured streaming with file sink (parquet) Wed, 14 Jun, 06:21
AssafMendelson spark higher order functions Tue, 20 Jun, 14:02
Message list1 · 2 · 3 · 4 · 5 · 6 · 7 · Next »Thread · Author · Date
Box list
Sep 2019131
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137