Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23994) Add Host To Blacklist If Shuffle Cannot Complete |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24904) Join with broadcasted dataframe causes shuffle of redundant data |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25377) spark sql dataframe cache is invalid |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23258) Should not split Arrow record batches based on row count |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-16707) TransportClientFactory.createClient may throw NPE |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23981) ShuffleBlockFetcherIterator - Spamming Logs |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25361) Support for Kinesis Client Library 2.0 |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24474) Cores are left idle when there are a lot of tasks to run |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24560) Fix some getTimeAsMs as getTimeAsSeconds |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-15882) Discuss distributed linear algebra in spark.ml package |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-16534) Kafka 0.10 Python support |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24095) Spark Streaming performance drastically drops when when saving dataframes with withColumn |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21624) Optimize communication cost of RF/GBT/DT |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23442) Reading from partitioned and bucketed table uses only bucketSpec.numBuckets partitions in all cases |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-14585) Provide accessor methods for Pipeline stages |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22105) Dataframe has poor performance when computing on many columns with codegen |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25230) Upper behavior incorrect for string contains "ß" |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25215) Make PipelineModel public |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23073) Fix incorrect R doc page header for generated sql functions |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22943) OneHotEncoder supports manual specification of categorySizes |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23571) Delete auxiliary Kubernetes resources upon application completion |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24946) PySpark - Allow np.Arrays and pd.Series in df.approxQuantile |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24440) When use constant as column we may get wrong answer versus impala |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24059) When blacklist disable always hash to a bad local directory may cause job failure |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24986) OOM in BufferHolder during writes to a stream |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25329) Support passing Kerberos configuration information |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22461) Move Spark ML model summaries into a dedicated package |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22823) Race Condition when reading Broadcast shuffle input. Failed to get broadcast piece |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24830) Problem with logging on Glassfish |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24074) Maven package resolver downloads javadoc instead of jar |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-17877) Can not checkpoint connectedComponents resulting graph |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23485) Kubernetes should support node blacklist |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21972) Allow users to control input data persistence in ML Estimators via a handlePersistence ml.Param |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24202) Separate SQLContext dependency from SparkSession.implicits |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25059) Exception while executing an action on DataFrame that read Json |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23058) Show create table can't show non printable field delim |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-3723) DecisionTree, RandomForest: Add more instrumentation |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24210) incorrect handling of boolean expressions when using column in expressions in pyspark.sql.DataFrame filter function |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25480) Dynamic partitioning + saveAsTable with multiple partition columns create empty directory |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-10413) ML models should support prediction on single instances |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20598) Iterative checkpoints do not get removed from HDFS |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23612) Specify formats for individual DateType and TimestampType columns in schemas |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24200) Read subdirectories with out asterisks |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24406) Exposing custom spark scala ml transformers in pyspark |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25024) Update mesos documentation to be clear about security supported |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25244) [Python] Setting `spark.sql.session.timeZone` only partially respected |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24608) report number of iteration/progress for ML training |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22868) 64KB JVM bytecode limit problem with aggregation |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25125) Spark SQL percentile_approx takes longer than Hive version for large datasets |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22055) Port release scripts |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-7206) Gaussian Mixture Model (GMM) improvements |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25165) Cannot parse Hive Struct |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-17129) Support statistics collection and cardinality estimation for partitioned tables |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24265) lintr checks not failing PR build |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25232) Support Full-Text Search in Spark SQL |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24568) Code refactoring for DataType equalsXXX methods |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23322) Launcher handles can miss application updates if application finishes too quickly |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22054) Allow release managers to inject their keys |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23452) Extend test coverage to all ORC readers |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23632) sparkR.session() error with spark packages - JVM is not ready after 10 seconds |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20007) Make SparkR apply() functions robust to workers that return empty data.frame |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24461) Snapshot Cache |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21812) PySpark ML Models should not depend transfering params from Java |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-12449) Pushing down arbitrary logical plans to data sources |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-8696) Streaming API for Online LDA |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24650) GroupingSet |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24862) Spark Encoder is not consistent to scala case class semantic for multiple argument lists |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24597) Spark ML Pipeline Should support non-linear models => DAGPipeline |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-15694) Implement ScriptTransformation in sql/core |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24845) spark distribution generate exception while locally worked correctly |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22658) SPIP: TeansorFlowOnSpark as a Scalable Deep Learning Lib of Apache Spark |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24623) Hadoop - Spark Cluster - Python XGBoost - Not working in distributed mode |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-19498) Discussion: Making MLlib APIs extensible for 3rd party libraries |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-17602) PySpark - Performance Optimization Large Size of Broadcast Variable |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24016) Yarn does not update node blacklist in static allocation |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23068) Jekyll doc build error does not fail build |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24106) Spark Structure Streaming with RF model taking long time in processing probability for each mini batch |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22780) make insert commands have real children to fix UI issues |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-8767) Abstractions for InputColParam, OutputColParam |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20885) JDBC predicate pushdown uses hardcoded date format |
Tue, 08 Oct, 05:45 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20782) Dataset's isCached operator |
Tue, 08 Oct, 05:45 |
Maxim Gekk (Jira) |
[jira] [Commented] (SPARK-24640) size(null) returns null |
Tue, 08 Oct, 05:53 |
angerszhu (Jira) |
[jira] [Created] (SPARK-29379) SHOW FUNCTIONS don't show '!=', '<>' , 'between', 'case' |
Tue, 08 Oct, 05:59 |
zhengruifeng (Jira) |
[jira] [Resolved] (SPARK-29269) Pyspark ALSModel support getters/setters |
Tue, 08 Oct, 06:07 |
Maxim Gekk (Jira) |
[jira] [Comment Edited] (SPARK-24640) size(null) returns null |
Tue, 08 Oct, 06:12 |
zhengruifeng (Jira) |
[jira] [Commented] (SPARK-29269) Pyspark ALSModel support getters/setters |
Tue, 08 Oct, 06:14 |
angerszhu (Jira) |
[jira] [Commented] (SPARK-29379) SHOW FUNCTIONS don't show '!=', '<>' , 'between', 'case' |
Tue, 08 Oct, 06:18 |
Xiao Li (Jira) |
[jira] [Resolved] (SPARK-29366) Subqueries created for DPP are not printed in EXPLAIN FORMATTED |
Tue, 08 Oct, 06:40 |
zhengruifeng (Jira) |
[jira] [Issue Comment Deleted] (SPARK-29269) Pyspark ALSModel support getters/setters |
Tue, 08 Oct, 06:44 |
zhengruifeng (Jira) |
[jira] [Assigned] (SPARK-29269) Pyspark ALSModel support getters/setters |
Tue, 08 Oct, 06:44 |
zhengruifeng (Jira) |
[jira] [Commented] (SPARK-29212) Add common classes without using JVM backend |
Tue, 08 Oct, 07:37 |
Maxim Gekk (Jira) |
[jira] [Reopened] (SPARK-24640) size(null) returns null |
Tue, 08 Oct, 07:39 |
zhengruifeng (Jira) |
[jira] [Created] (SPARK-29380) RFormula avoid repeated 'first' jobs to get vector size |
Tue, 08 Oct, 07:43 |
zhengruifeng (Jira) |
[jira] [Created] (SPARK-29381) Add 'private' _XXXParams classes for classification & regression |
Tue, 08 Oct, 07:49 |
huangtianhua (Jira) |
[jira] [Commented] (SPARK-29222) Flaky test: pyspark.mllib.tests.test_streaming_algorithms.StreamingLinearRegressionWithTests.test_parameter_convergence |
Tue, 08 Oct, 08:21 |
angerszhu (Jira) |
[jira] [Issue Comment Deleted] (SPARK-29379) SHOW FUNCTIONS don't show '!=', '<>' , 'between', 'case' |
Tue, 08 Oct, 09:07 |
Philipp Angerer (Jira) |
[jira] [Reopened] (SPARK-9636) Treat $SPARK_HOME as write-only |
Tue, 08 Oct, 09:18 |
Maxim Gekk (Jira) |
[jira] [Created] (SPARK-29382) Support the `INTERVAL` type by Parquet datasource |
Tue, 08 Oct, 09:39 |
Maxim Gekk (Jira) |
[jira] [Created] (SPARK-29383) Support the optional prefix `@` in interval strings |
Tue, 08 Oct, 09:45 |
Maxim Gekk (Jira) |
[jira] [Created] (SPARK-29384) Support `ago` in interval strings |
Tue, 08 Oct, 09:48 |