spark-user mailing list archives: March 2016

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · Next »Thread · Author · Date
Akhil Das Re: Issue with wholeTextFiles Tue, 22 Mar, 06:58
Akhil Das Re: Issue with wholeTextFiles Tue, 22 Mar, 09:02
Akhil Das Re: Null pointer exception when using com.databricks.spark.csv Wed, 30 Mar, 06:57
Akhil Das Re: aggregateByKey on PairRDD Wed, 30 Mar, 07:01
Akhil Das Re: Master options Cluster/Client descrepencies. Wed, 30 Mar, 07:11
Akhil Das Re: Unable to Limit UI to localhost interface Wed, 30 Mar, 07:25
Akhil Das Re: Spark streaming spilling all the data to disk even if memory available Wed, 30 Mar, 07:28
Akhil Das Re: Spark streaming spilling all the data to disk even if memory available Thu, 31 Mar, 12:47
Alan Braithwaite Re: Use cases for kafka direct stream messageHandler Wed, 09 Mar, 19:19
Alex Re: Hive Context: Hive Metastore Client Tue, 08 Mar, 23:28
Alex Re: Hive Context: Hive Metastore Client Wed, 09 Mar, 00:24
Alex Dzhagriev Re: Sorting the RDD Thu, 03 Mar, 09:02
Alex Dzhagriev Re: Enabling spark_shuffle service without restarting YARN Node Manager Wed, 16 Mar, 10:30
Alex F Hive Context: Hive Metastore Client Tue, 08 Mar, 22:23
Alex Kozlov Re: Spark on RAID Tue, 08 Mar, 16:45
Alex Kozlov Re: sparkR issues ? Tue, 15 Mar, 06:58
Alex Kozlov Re: sparkR issues ? Tue, 15 Mar, 16:51
AlexModestov sql functions: row_number, percent_rank, rank,rowNumber Thu, 10 Mar, 09:24
Alexander Krasnukhin Re: Converting String to Datetime using map Thu, 24 Mar, 22:00
Alexander Krasnukhin Re: Custom RDD in spark, cannot find custom method Sun, 27 Mar, 17:09
Alexander Krasnukhin Re: Custom RDD in spark, cannot find custom method Sun, 27 Mar, 18:16
Alexander Krasnukhin Re: Aggregate subsequenty x row values together. Mon, 28 Mar, 16:44
Alexander Krasnukhin Re: [Spark SQL] Unexpected Behaviour Mon, 28 Mar, 22:00
Alexander Krasnukhin Re: looking for an easy to to find the max value of a column in a data frame Tue, 29 Mar, 00:55
Alexander Krasnukhin Re: looking for an easy to to find the max value of a column in a data frame Tue, 29 Mar, 17:14
Alexander Krasnukhin Re: looking for an easy to to find the max value of a column in a data frame Tue, 29 Mar, 17:42
Alexander Pivovarov Re: EMR 4.3.0 spark 1.6 shell problem Tue, 01 Mar, 19:26
Alexander Pivovarov Is there Graph Partitioning impl for Scala/Spark? Thu, 10 Mar, 06:40
Alexander Pivovarov Re: Graphx Fri, 11 Mar, 18:12
Alexander Pivovarov Re: Is there Graph Partitioning impl for Scala/Spark? Fri, 11 Mar, 18:17
Alexander Pivovarov Re: YARN process with Spark Fri, 11 Mar, 22:56
Alexander Pivovarov Re: YARN process with Spark Fri, 11 Mar, 23:01
Alexander Pivovarov Re: YARN process with Spark Fri, 11 Mar, 23:29
Alexander Pivovarov Re: Spark with Yarn Client Sat, 12 Mar, 04:18
Alexander Pivovarov Re: YARN process with Spark Mon, 14 Mar, 16:59
Alexander Pivovarov Re: Testing spark with AWS spot instances Sun, 27 Mar, 06:50
Alexander Pivovarov Re: Running Spark on Yarn Tue, 29 Mar, 21:01
Alexander Pivovarov Re: Running Spark on Yarn Tue, 29 Mar, 21:22
Alexander Pivovarov Re: Running Spark on Yarn Tue, 29 Mar, 21:45
Alexander Pivovarov Re: Running Spark on Yarn Tue, 29 Mar, 21:57
Alexander Pivovarov Re: Spark and N-tier architecture Tue, 29 Mar, 22:50
Alexander Pivovarov Re: Spark and N-tier architecture Tue, 29 Mar, 23:54
Alexander Pivovarov Re: Spark and N-tier architecture Tue, 29 Mar, 23:56
Alexis Roos Re: Graphx Fri, 11 Mar, 18:23
Allen George Restarting an executor during execution causes it to lose AWS credentials (anyone seen this?) Thu, 17 Mar, 16:01
Alonso Isidoro Roman What version of twitter4j should I use with Spark Streaming?UPDATING thread Tue, 01 Mar, 17:08
Alonso Isidoro Roman Re: Problem with jackson lib running on spark Thu, 31 Mar, 17:52
Alonso Isidoro Roman Re: Problem with jackson lib running on spark Thu, 31 Mar, 18:08
Andres.Fernan...@wellsfargo.com RE: Save DataFrame to Hive Table Tue, 01 Mar, 17:00
Andres.Fernan...@wellsfargo.com Union Parquet, DataFrame Tue, 01 Mar, 17:01
Andres.Fernan...@wellsfargo.com RE: Union Parquet, DataFrame Tue, 01 Mar, 17:18
Andres.Fernan...@wellsfargo.com Rename Several Aggregated Columns Fri, 18 Mar, 16:10
Andres.Fernan...@wellsfargo.com RE: Rename Several Aggregated Columns Tue, 22 Mar, 17:18
Andrew A Graphx Thu, 10 Mar, 21:44
Andrew Heinrichs Unsubscribe Tue, 29 Mar, 01:36
Andrew Or Re: Using dynamic allocation and shuffle service in Standalone Mode Tue, 08 Mar, 19:39
Andrew Or Re: No event log in /tmp/spark-events Tue, 08 Mar, 19:46
Andy Dang Re: an OOM while persist as DISK_ONLY Thu, 03 Mar, 23:59
Andy Davidson understanding performance what does it mean if a stage is skipped? Mon, 07 Mar, 20:42
Andy Davidson streaming app performance when would increasing execution size or adding more cores Mon, 07 Mar, 21:53
Andy Davidson how to implement and deploy robust streaming apps Mon, 07 Mar, 22:10
Andy Davidson streaming will I loose data if spark.streaming.backpressure.enabled=true Mon, 07 Mar, 22:27
Andy Davidson Re: Spark Streaming, very slow processing and increasing scheduling delay of kafka input stream Tue, 08 Mar, 01:13
Andy Davidson pyspark spark-cassandra-connector java.io.IOException: Failed to open native connection to Cassandra at {192.168.1.126}:9042 Wed, 09 Mar, 02:02
Andy Davidson Re: pyspark spark-cassandra-connector java.io.IOException: Failed to open native connection to Cassandra at {192.168.1.126}:9042 Wed, 09 Mar, 02:25
Andy Davidson Re: pyspark spark-cassandra-connector java.io.IOException: Failed to open native connection to Cassandra at {192.168.1.126}:9042 Wed, 09 Mar, 19:11
Andy Davidson trouble with NUMPY constructor in UDF Wed, 09 Mar, 23:09
Andy Davidson Re: Spark Streaming, very slow processing and increasing scheduling delay of kafka input stream Thu, 10 Mar, 22:40
Andy Davidson Re: trouble with NUMPY constructor in UDF Thu, 10 Mar, 22:52
Andy Davidson Re: trouble with NUMPY constructor in UDF Thu, 10 Mar, 22:56
Andy Davidson Re: Spark Streaming, very slow processing and increasing scheduling delay of kafka input stream Thu, 10 Mar, 23:06
Andy Davidson newbie HDFS S3 best practices Tue, 15 Mar, 18:45
Andy Davidson Re: newbie HDFS S3 best practices Tue, 15 Mar, 20:43
Andy Davidson what is the pyspark inverse of registerTempTable()? Tue, 15 Mar, 23:40
Andy Davidson Re: what is the pyspark inverse of registerTempTable()? Wed, 16 Mar, 00:27
Andy Davidson best practices: running multi user jupyter notebook server Wed, 16 Mar, 17:36
Andy Davidson unix_timestamp() time zone problem Thu, 17 Mar, 20:28
Andy Davidson sql timestamp timezone bug Thu, 17 Mar, 22:02
Andy Davidson Re: sql timestamp timezone bug Thu, 17 Mar, 22:25
Andy Davidson Re: sql timestamp timezone bug Fri, 18 Mar, 18:10
Andy Davidson bug spark should not use java.sql.timestamp was: sql timestamp timezone bug Fri, 18 Mar, 19:16
Andy Davidson pyspark sql convert long to timestamp? Mon, 21 Mar, 23:19
Andy Davidson Re: pyspark sql convert long to timestamp? Wed, 23 Mar, 00:36
Andy Davidson Re: --packages configuration equivalent item name? Mon, 28 Mar, 15:44
Andy Davidson looking for an easy to to find the max value of a column in a data frame Tue, 29 Mar, 00:15
Andy Davidson pyspark unable to convert dataframe column to a vector: Unable to instantiate org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient Tue, 29 Mar, 01:29
Andy Davidson Re: Sending events to Kafka from spark job Tue, 29 Mar, 16:40
Andy Davidson Re: looking for an easy to to find the max value of a column in a data frame Tue, 29 Mar, 16:58
Andy Davidson Re: looking for an easy to to find the max value of a column in a data frame Tue, 29 Mar, 17:50
Andy Davidson Vectors.sparse exception: TypeError: indices array must be sorted Tue, 29 Mar, 18:42
Andy Davidson data frame problem preserving sort order with repartition() and coalesce() Tue, 29 Mar, 21:49
Andy Sloane getPreferredLocations race condition in spark 1.6.0? Wed, 02 Mar, 23:46
Andy Sloane Re: getPreferredLocations race condition in spark 1.6.0? Thu, 03 Mar, 00:28
Andy Sloane Saving multiple outputs in the same job Wed, 09 Mar, 01:31
Andy Sloane Re: binary file deserialization Wed, 09 Mar, 22:57
Angel Angel Connect the two tables in spark sql Wed, 02 Mar, 03:13
Angel Angel Spark sql query taking long time Thu, 03 Mar, 05:33
Angel Angel Sorting the RDD Thu, 03 Mar, 07:39
Angel Angel Sorting the dataframe Fri, 04 Mar, 08:18
Angel Angel Add the sql record having same field. Sun, 06 Mar, 06:28
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · Next »Thread · Author · Date
Box list
Sep 202173
Aug 2021171
Jul 2021158
Jun 2021179
May 2021187
Apr 2021267
Mar 2021346
Feb 2021166
Jan 2021242
Dec 2020203
Nov 2020147
Oct 2020236
Sep 2020136
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137