spark-user mailing list archives: May 2015

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · Next »Thread · Author · Date
Tamas Jambor Re: store hive metastore on persistent store Sat, 16 May, 13:07
Antony Mayi IF in SQL statement Sat, 16 May, 14:21
Don Drake Re: Spark sql and csv data processing question Sat, 16 May, 14:45
smazumder Spark SQL is not able to connect to hive metastore Sat, 16 May, 15:14
ayan guha Re: IF in SQL statement Sat, 16 May, 15:17
ayan guha Re: Spark SQL is not able to connect to hive metastore Sat, 16 May, 15:48
ayan guha Re: Spark SQL is not able to connect to hive metastore Sat, 16 May, 15:49
Sourav Mazumder Re: Spark SQL is not able to connect to hive metastore Sat, 16 May, 16:40
ayan guha Re: How to reshape RDD/Spark DataFrame Sat, 16 May, 17:12
Ram Sriharsha Re: Getting the best parameter set back from CrossValidatorModel Sat, 16 May, 17:44
ayan guha Re: Spark SQL is not able to connect to hive metastore Sat, 16 May, 17:44
Nisrina Luthfiyati Re: Grouping and storing unordered time series data stream to HDFS Sat, 16 May, 18:00
Yana Kadiyska Re: store hive metastore on persistent store Sat, 16 May, 18:41
Fernando O. Problem building master on 2.11 Sat, 16 May, 19:09
Tamas Jambor Re: store hive metastore on persistent store Sat, 16 May, 19:10
jaredtims RE: Running Spark/YARN on AWS EMR - Issues finding file on hdfs? Sat, 16 May, 20:52
jaredtims Re: zip files submitted with --py-files disappear from hdfs after a while on EMR Sat, 16 May, 20:57
yaochunnan How can I do pair-wise computation between RDD feature columns? Sun, 17 May, 01:13
Davies Liu Re: PySpark: slicing issue with dataframes Sun, 17 May, 08:05
Davies Liu Re: how to set random seed Sun, 17 May, 08:12
Davies Liu Re: Multiple DataFrames per Parquet file? Sun, 17 May, 08:18
xiaohe lan println in spark-shell Sun, 17 May, 09:01
Sean Owen Re: println in spark-shell Sun, 17 May, 10:07
Slim Baltagi Big Data Day LA: FREE Big Data Conference in Los Angeles on June 27, 2015 Sun, 17 May, 13:47
dgoldenberg Spark Streaming and reducing latency Sun, 17 May, 13:51
Akhil Das Re: Spark Streaming and reducing latency Sun, 17 May, 15:04
Akhil Das Re: number of executors Sun, 17 May, 15:16
Akhil Das Re: Forbidded : Error Code: 403 Sun, 17 May, 15:21
Akhil Das Re: textFileStream Question Sun, 17 May, 15:25
Akhil Das Re: [SparkStreaming] Is it possible to delay the start of some DStream in the application? Sun, 17 May, 15:28
mas Effecient way to fetch all records on a particular node/partition in GraphX Sun, 17 May, 15:32
xiaohe lan Re: number of executors Sun, 17 May, 15:50
xiaohe lan Re: number of executors Sun, 17 May, 15:59
Jan-Paul Bultmann Re: Best practice to avoid ambiguous columns in DataFrame.join Sun, 17 May, 16:31
Evo Eftimov RE: [SparkStreaming] Is it possible to delay the start of some DStream in the application? Sun, 17 May, 16:39
Evo Eftimov RE: Spark Streaming and reducing latency Sun, 17 May, 16:55
Justin Pihony Trying to understand sc.textFile better Sun, 17 May, 17:01
MUHAMMAD AAMIR Re: Data partitioning and node tracking in Spark-GraphX Sun, 17 May, 17:55
Ankur Dave Re: Effecient way to fetch all records on a particular node/partition in GraphX Sun, 17 May, 18:45
Vadim Bichutskiy Re: textFileStream Question Sun, 17 May, 19:24
Michael Armbrust Re: Best practice to avoid ambiguous columns in DataFrame.join Sun, 17 May, 19:41
Peng Cheng Union of checkpointed RDD in Apache Spark has long (> 10 hour) between-stage latency Sun, 17 May, 21:58
Peng Cheng Re: Union of checkpointed RDD in Apache Spark has long (> 10 hour) between-stage latency Sun, 17 May, 22:37
Peng Cheng Re: Union of checkpointed RDD in Apache Spark has long (> 10 hour) between-stage latency Sun, 17 May, 23:15
Peng Cheng Re: Union of checkpointed RDD in Apache Spark has long (> 10 hour) between-stage latency Sun, 17 May, 23:19
Rajdeep Dua InferredSchema Example in Spark-SQL Mon, 18 May, 00:07
Cheng, Hao RE: InferredSchema Example in Spark-SQL Mon, 18 May, 00:28
Ram Sriharsha Re: InferredSchema Example in Spark-SQL Mon, 18 May, 00:30
Cheng, Hao RE: InferredSchema Example in Spark-SQL Mon, 18 May, 00:41
Haopu Wang RE: [SparkStreaming] Is it possible to delay the start of some DStream in the application? Mon, 18 May, 02:51
Rajdeep Dua Re: InferredSchema Example in Spark-SQL Mon, 18 May, 03:05
Rajdeep Dua Re: InferredSchema Example in Spark-SQL Mon, 18 May, 03:05
Ram Sriharsha Re: InferredSchema Example in Spark-SQL Mon, 18 May, 03:07
Fengyun RAO Re: --jars works in "yarn-client" but not "yarn-cluster" mode, why? Mon, 18 May, 03:49
Justin Yip Re: Getting the best parameter set back from CrossValidatorModel Mon, 18 May, 05:17
Justin Yip Implementing custom metrics under MLPipeline's BinaryClassificationEvaluator Mon, 18 May, 05:35
Simon Elliston Ball Re: InferredSchema Example in Spark-SQL Mon, 18 May, 05:40
MEETHU MATHEW Re: How to run multiple jobs in one sparkcontext from separate threads in pyspark? Mon, 18 May, 06:35
Xiangrui Meng Re: StandardScaler failing with OOM errors in PySpark Mon, 18 May, 06:49
Xiangrui Meng Re: bug: numClasses is not a valid argument of LogisticRegressionWithSGD Mon, 18 May, 06:50
Xiangrui Meng Re: MLLib SVMWithSGD is failing for large dataset Mon, 18 May, 06:55
ayan guha Re: How to run multiple jobs in one sparkcontext from separate threads in pyspark? Mon, 18 May, 07:34
Yi.Zhang How to debug spark in IntelliJ Idea Mon, 18 May, 07:37
Pa Rö Fwd: Spark and Flink Mon, 18 May, 08:00
patcharee AccessControlException hive table created from spark shell Mon, 18 May, 09:27
Pa Rö k-means core function for temporal geo data Mon, 18 May, 09:30
Steve Loughran Re: Spark's Guava pieces cause exceptions in non-trivial deployments Mon, 18 May, 10:05
hotienvu NullPointerException when accessing broadcast variable in DStream Mon, 18 May, 10:07
MEETHU MATHEW Re: Restricting the number of iterations in Mllib Kmeans Mon, 18 May, 10:22
Chandra Mohan, Ananda Vel Murugan Spark sql error while writing Parquet file- Trying to write more fields than contained in row Mon, 18 May, 10:29
Dmitry Goldenberg Re: Spark Streaming and reducing latency Mon, 18 May, 10:46
Evo Eftimov RE: Spark Streaming and reducing latency Mon, 18 May, 11:12
ayan guha Re: Spark sql error while writing Parquet file- Trying to write more fields than contained in row Mon, 18 May, 11:49
Evo Eftimov RE: Spark Streaming and reducing latency Mon, 18 May, 11:50
Robert Metzger Re: Spark and Flink Mon, 18 May, 12:05
Akhil Das Re: Spark Streaming and reducing latency Mon, 18 May, 13:03
Evo Eftimov RE: Spark Streaming and reducing latency Mon, 18 May, 13:23
Akhil Das Re: Spark Streaming and reducing latency Mon, 18 May, 13:28
Guillermo Ortiz Working with slides. How do I know how many times a RDD has been processed? Mon, 18 May, 13:36
Laeeq Ahmed Processing multiple columns in parallel Mon, 18 May, 13:37
Evo Eftimov RE: Spark Streaming and reducing latency Mon, 18 May, 13:38
Mohammad Tariq Re: Forbidded : Error Code: 403 Mon, 18 May, 13:49
juandasgandaras Spark streaming over a rest API Mon, 18 May, 14:21
Oleg Ruchovets pass configuration parameters to PySpark job Mon, 18 May, 14:26
Ricardo Goncalves da Silva parsedData option Mon, 18 May, 14:32
ayan guha Re: Processing multiple columns in parallel Mon, 18 May, 14:46
Needham, Guy RE: Processing multiple columns in parallel Mon, 18 May, 15:50
Akhil Das RE: Spark Streaming and reducing latency Mon, 18 May, 15:56
Sandy Ryza Re: number of executors Mon, 18 May, 16:07
Sandy Ryza Re: number of executors Mon, 18 May, 16:07
Rajdeep Dua Re: InferredSchema Example in Spark-SQL Mon, 18 May, 16:19
Shay Rojansky py-files (and others?) not properly set up in cluster-mode Spark Yarn job? Mon, 18 May, 16:38
edward cui Re: number of executors Mon, 18 May, 16:53
edward cui Re: number of executors Mon, 18 May, 16:55
Akhil Das Re: Reading Real Time Data only from Kafka Mon, 18 May, 17:00
Akhil Das Re: Spark streaming over a rest API Mon, 18 May, 17:02
zia_kayani org.apache.spark.shuffle.FetchFailedException :: Migration from Spark 1.2 to 1.3 Mon, 18 May, 17:19
tomboyle Spark groupByKey, does it always create at least 1 partition per key? Mon, 18 May, 17:38
Marcelo Vanzin Re: py-files (and others?) not properly set up in cluster-mode Spark Yarn job? Mon, 18 May, 18:02
Evo Eftimov RE: Spark Streaming and reducing latency Mon, 18 May, 18:28
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · Next »Thread · Author · Date
Box list
Jul 2021131
Jun 2021179
May 2021187
Apr 2021267
Mar 2021346
Feb 2021166
Jan 2021242
Dec 2020203
Nov 2020147
Oct 2020236
Sep 2020136
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137