spark-user mailing list archives: July 2015

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · 22 · 23 · 24 · Next »Thread · Author · Date
foobar Multiple operations on same DStream in Spark Streaming Sat, 25 Jul, 22:07
fordfarline Problem submiting an script .py against an standalone cluster. Fri, 31 Jul, 02:19
gaurav sharma Worker dies with java.io.IOException: Stream closed Sat, 11 Jul, 17:48
gaurav sharma Re: Worker dies with java.io.IOException: Stream closed Sun, 12 Jul, 11:04
gaurav sharma Re: createDirectStream and Stats Sun, 12 Jul, 11:30
gaurav sharma Re: Spark Streaming Kafka could not find leader offset for Set() Fri, 31 Jul, 05:40
gen tang Strange behavoir of pyspark with --jars option Wed, 15 Jul, 06:15
gireeshp Spark equivalent for Oracle's analytical functions Mon, 06 Jul, 11:06
gisleyt Tasks unevenly distributed in Spark 1.4.0 Wed, 15 Jul, 15:48
glen java.io.IOException: failure to login Tue, 28 Jul, 08:18
gulyasm Iterating over values by Key Tue, 28 Jul, 11:15
guoqing0...@yahoo.com.hk streaming issue Tue, 28 Jul, 02:48
hans ziqiu li TFIDF Transformation Thu, 30 Jul, 17:45
harirajaram Share RDD from SparkR and another application Mon, 13 Jul, 12:30
harirajaram Re: Share RDD from SparkR and another application Tue, 14 Jul, 14:32
harirajaram Re: Share RDD from SparkR and another application Tue, 14 Jul, 14:33
harirajaram Re: SparkR sqlContext or sc not found in RStudio Tue, 21 Jul, 15:05
harirajaram Re: SparkR sqlContext or sc not found in RStudio Tue, 21 Jul, 19:32
harirajaram Re: SparkR sqlContext or sc not found in RStudio Tue, 21 Jul, 20:02
harirajaram Re: How to keep RDDs in memory between two different batch jobs? Wed, 22 Jul, 19:28
harris Re: Reading Avro files from Streaming Wed, 08 Jul, 17:58
hbogert Scheduler delay vs. Getting result time Thu, 09 Jul, 18:44
hermansc Running Spark on user-provided Hadoop installation Thu, 30 Jul, 08:48
huanglr Spark Broadcasting large dataset Fri, 10 Jul, 13:52
huanglr Re: RE: Spark Broadcasting large dataset Fri, 10 Jul, 15:13
huanglr Broadcast HashMap much slower than Array Fri, 24 Jul, 14:12
igor.berman upload to s3, UI Total Duration and Sum of Job Durations Wed, 01 Jul, 07:14
igor.berman 1.4.1 in production Mon, 20 Jul, 08:03
igor.berman log4j.xml bundled in jar vs log4.properties in spark/conf Tue, 21 Jul, 07:57
jamaica Re: Solving Systems of Linear Equations Using Spark? Fri, 03 Jul, 06:17
jan.zi...@centrum.cz spark-ec2 credentials using aws_security_token Fri, 24 Jul, 12:30
jan.zi...@centrum.cz spark spark-ec2 credentials using aws_security_token Mon, 27 Jul, 07:43
java8964 RE: Use rank with distribute by in HiveContext Thu, 16 Jul, 13:41
jay vyas Re: How to build Spark with my own version of Hadoop? Wed, 22 Jul, 12:51
jegordon Remote spark-submit not working with YARN Wed, 08 Jul, 22:19
jegordon Pyspark not working on yarn-cluster mode Thu, 09 Jul, 21:23
jianshu [SparkR] creating dataframe from json file Wed, 15 Jul, 08:42
jianshu Weng Re: [SparkR] creating dataframe from json file Wed, 15 Jul, 13:37
jitender Re: java.lang.OutOfMemoryError: PermGen space Tue, 07 Jul, 16:53
k0ala DataFrame more efficient than RDD? Wed, 15 Jul, 14:41
kachau How to call hiveContext.sql() on all the Hive partitions in parallel? Mon, 06 Jul, 17:21
kachau How do we control output part files created by Spark job? Mon, 06 Jul, 17:23
kachau SparkR Error in sparkR.init(master=“local”) in RStudio Fri, 10 Jul, 16:30
kachau dataFrame.colaesce(1) or dataFrame.reapartition(1) does not seem work for me Fri, 10 Jul, 16:48
kaushal Re: SparkSQL 1.4 can't accept registration of UDF? Thu, 30 Jul, 06:43
keegan pyspark equivalent to Extends Serializable Tue, 21 Jul, 15:50
khaledh Are Spark Streaming RDDs always processed in order? Sat, 04 Jul, 02:12
leonida.gianfagna Re: Sum elements of an iterator inside an RDD Sat, 11 Jul, 18:02
lfiaschi BigQuery connector for pyspark via Hadoop Input Format example Sat, 18 Jul, 11:19
lokeshkumar Spark 1.4.0 compute-classpath.sh Wed, 15 Jul, 16:43
lokeshkumar Spark 1.4.0 org.apache.spark.sql.AnalysisException: cannot resolve 'probability' given input columns Thu, 16 Jul, 10:41
luohui20...@sina.com 回复:Re: got "java.lang.reflect.UndeclaredThrowableException" when running multiply APPs in spark Wed, 01 Jul, 01:58
luohui20...@sina.com All master are unreponsive issue Thu, 02 Jul, 09:31
luohui20...@sina.com 回复:All master are unreponsive issue Fri, 03 Jul, 03:35
luohui20...@sina.com How to shut down spark web UI? Mon, 06 Jul, 09:05
luohui20...@sina.com 回复:Re: How to shut down spark web UI? Tue, 07 Jul, 02:25
luohui20...@sina.com Hibench build fail Tue, 07 Jul, 06:50
luohui20...@sina.com 回复:RE: Hibench build fail Wed, 08 Jul, 07:38
luohui20...@sina.com 回复:回复:RE: Hibench build fail Wed, 08 Jul, 08:14
luohui20...@sina.com HiBench test for hadoop/hive/spark cluster Thu, 16 Jul, 03:53
luohui20...@sina.com 回复:Re: HiBench test for hadoop/hive/spark cluster Thu, 16 Jul, 04:38
madhu phatak Running mllib from R in Spark 1.4 Wed, 15 Jul, 13:00
madhu phatak Re: Running mllib from R in Spark 1.4 Thu, 16 Jul, 02:12
man june looking for helps in using graphx aggregateMessages Fri, 31 Jul, 10:27
manohar override/update options in Dataframe/JdbcRdd Thu, 02 Jul, 13:10
martinibus77 Spark SQL DataFrame: Nullable column and filtering Thu, 30 Jul, 18:19
matd spark ec2 as non-root / any plan to improve that in the future ? Thu, 09 Jul, 07:24
matd what is metadata in StructField ? Wed, 15 Jul, 09:48
mathewvinoj spark cache issue while doing saveAsTextFile and saveAsParquetFile Wed, 15 Jul, 05:03
maxdml Master doesn't start, no logs Mon, 06 Jul, 19:24
maxdml Re: Spark standalone cluster - Output file stored in temporary directory in worker Tue, 07 Jul, 00:48
maxdml Re: Spark standalone cluster - Output file stored in temporary directory in worker Tue, 07 Jul, 15:14
maxdml Re: akka.remote.transport.Transport$InvalidAssociationException: The remote system terminated the association because it is shutting down Wed, 08 Jul, 11:56
maxdml Re: Issues when combining Spark and a third party java library Fri, 10 Jul, 13:21
maxdml Re: Issues when combining Spark and a third party java library Fri, 10 Jul, 21:47
maxdml Re: Is it possible to change the default port number 7077 for spark? Sun, 12 Jul, 18:15
maxdml HDFS performances + unexpected death of executors. Mon, 13 Jul, 20:48
maxdml Re: How to make my spark implementation parallel? Mon, 13 Jul, 21:03
maxdml Re: How to make my spark implementation parallel? Mon, 13 Jul, 21:19
michal.klo...@gmail.com Re: Reading SequenceFiles from S3 with PySpark on EMR causes RACK_LOCAL locality Sat, 18 Jul, 02:04
micvog Spark Streaming broadcast to all keys Fri, 03 Jul, 11:34
micvog Does spark guarantee that the same task will process the same key over time? Thu, 09 Jul, 19:30
micvog Best way to avoid updateStateByKey from running without data Fri, 10 Jul, 11:30
mike streamingContext.stop(true,true) doesn't end the job Wed, 29 Jul, 16:31
n...@reactor8.com RE: Benchmark results between Flink and Spark Mon, 06 Jul, 04:10
nib...@free.fr Best practice for transforming and storing from Spark to Mongo/HDFS Sat, 25 Jul, 14:14
nitinkalra2000 Re: JdbcRDD and ClassTag issue Mon, 20 Jul, 12:11
nitinkalra2000 Apache Spark : spark.eventLog.dir on Windows Environment Mon, 20 Jul, 12:15
nizang cores and resource management Sun, 05 Jul, 19:52
nizang SnappyCompressionCodec on the master Wed, 08 Jul, 07:47
nkd output folder structure not getting commited and remains as _temporary Wed, 01 Jul, 01:28
ogoh SparkSQL 1.4 can't accept registration of UDF? Wed, 15 Jul, 00:10
pedro Re: .NET on Apache Spark? Thu, 02 Jul, 19:26
pedro Misaligned Rows with UDF Tue, 14 Jul, 21:34
pedro Python DataFrames, length of array Wed, 15 Jul, 23:05
pedro Python DataFrames: length of ArrayType Wed, 15 Jul, 23:31
phagunbaya Scaling spark cluster for a running application Wed, 22 Jul, 11:20
pjmccarthy Twitter4J streaming question Thu, 23 Jul, 19:23
plazaster Re: Kmeans Labeled Point RDD Mon, 20 Jul, 06:37
pnpritchard DataFrame.withColumn() recomputes columns even after cache() Tue, 14 Jul, 20:30
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · 22 · 23 · 24 · Next »Thread · Author · Date
Box list
Sep 202071
Aug 2020166
Jul 2020248
Jun 2020263
May 2020282
Apr 2020335
Mar 2020232
Feb 2020136
Jan 2020141
Dec 2019138
Nov 2019125
Oct 2019124
Sep 2019160
Aug 2019187
Jul 2019193
Jun 2019265
May 2019317
Apr 2019263
Mar 2019248
Feb 2019186
Jan 2019244
Dec 2018202
Nov 2018235
Oct 2018275
Sep 2018235
Aug 2018262
Jul 2018309
Jun 2018377
May 2018386
Apr 2018410
Mar 2018444
Feb 2018383
Jan 2018332
Dec 2017350
Nov 2017267
Oct 2017410
Sep 2017452
Aug 2017525
Jul 2017520
Jun 2017645
May 2017549
Apr 2017564
Mar 2017621
Feb 2017744
Jan 2017889
Dec 2016865
Nov 20161118
Oct 20161115
Sep 20161402
Aug 20161564
Jul 20161684
Jun 20161457
May 20161496
Apr 20161411
Mar 20162044
Feb 20161799
Jan 20161740
Dec 20151870
Nov 20151541
Oct 20152041
Sep 20152125
Aug 20151978
Jul 20152343
Jun 20152366
May 20151864
Apr 20152314
Mar 20152577
Feb 20152187
Jan 20152152
Dec 20141937
Nov 20142024
Oct 20142244
Sep 20142094
Aug 20141949
Jul 20142389
Jun 20141773
May 20141397
Apr 20141459
Mar 20141286
Feb 20141029
Jan 2014925
Dec 2013611
Nov 2013558
Oct 2013505
Sep 2013235
Aug 201397
Jul 20137