Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24122) Allow automatic driver restarts on K8s |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23686) Make better usage of org.apache.spark.ml.util.Instrumentation |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24617) Spark driver not requesting another executor once original executor exits due to 'lost worker' |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24585) Adding ability to audit file system before and after test to ensure all files are cleaned up. |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21076) R dapply doesn't return array or raw columns when array have different length |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24750) HiveCaseSensitiveInferenceMode with INFER_AND_SAVE will show WRITE permission denied even if select table operation |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-18245) Improving support for bucketed table |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25107) Spark 2.2.0 Upgrade Issue : Throwing TreeNodeException: makeCopy, tree: CatalogRelation Errors |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-15516) Schema merging in driver fails for parquet when merging LongType and IntegerType |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-15691) Refactor and improve Hive support |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24266) Spark client terminates while driver is still running |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25219) KMeans Clustering - Text Data - Results are incorrect |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-15777) Catalog federation |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24550) Add support for Kubernetes specific metrics |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24114) improve instrumentation for spark.ml.recommendation |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20618) Support Custom Partitioners in PySpark |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24357) createDataFrame in Python infers large integers as long type and then fails silently when converting them |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21016) Improve code fault tolerance for converting string to number |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24118) Support lineSep format independent from encoding |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22964) don't allow task restarts for continuous processing |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-12014) Spark SQL query containing semicolon is broken in Beeline (related to HIVE-11100) |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23498) Accuracy problem in comparison with string and integer |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24051) Incorrect results for certain queries using Java and Python APIs on Spark 2.3.0 |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22963) Make failure recovery global and automatic for continuous processing. |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21122) Address starvation issues when dynamic allocation is enabled |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20624) Add better handling for node shutdown |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24733) Dataframe saved to parquet can have different metadata then the resulting parquet file |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24426) Unexpected combination of cache and join on DataFrame |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20691) Difference between Storage Memory as seen internally and in web UI |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24448) File not found on the address SparkFiles.get returns on standalone cluster |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25459) Add viewOriginalText back to CatalogTable |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21927) scalastyle 1.0.0 generates SBT warnings |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24306) Sort a Dataset with a lambda (like RDD.sortBy) |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23210) Introduce the concept of default value to schema |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-13127) Upgrade Parquet to 1.9 (Fixes parquet sorting) |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-3727) Trees and ensembles: More prediction functionality |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-17248) Add native Scala enum support to Dataset Encoders |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24258) SPIP: Improve PySpark support for ML Matrix and Vector types |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25263) Add scheduler integration test for SPARK-24909 |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23689) Spark 2.3.0/2.2.1 Some changes cause org.apache.spark.sql.catalyst.errors.package$TreeNodeException: |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24011) Cache rdd's immediate parent ShuffleDependencies to accelerate getShuffleDependencies() |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-18649) sc.textFile(my_file).collect() raises socket.timeout on large files |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24866) Artifactual ROC scores when scaling up Random Forest classifier |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22440) Add Calinski-Harabasz index to ClusteringEvaluator |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22391) add `MetadataCreationSupport` trait to separate data and metadata handling at write path |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24218) Allow Configuration of DynamoDbEndpointUrl in KinesisReceiver |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25193) insert overwrite doesn't throw exception when drop old data fails |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-16217) Support SELECT INTO statement |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23879) Introduce MemoryBlock API instead of Platform API with Object |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25198) org.apache.spark.sql.catalyst.parser.ParseException: DataType json is not supported. |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23705) dataframe.groupBy() may inadvertently receive sequence of non-distinct strings |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-13346) Using DataFrames iteratively leads to slow query planning |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20174) Analyzer gives mysterious AnalysisException when posexplode used in withColumn |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-19536) Improve capability to merge SQL data types |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21068) SparkR error message when passed an R object rather than Java object could be more informative |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24735) Improve exception when mixing up pandas_udf types |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24618) Allow ability to consume driver memory on worker hosts not master (option for clustermode to wait for returncode?) |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24362) SUM function precision issue |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20915) lpad/rpad with empty pad string different from MySQL |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22925) ml model persistence creates a lot of small files |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23954) Converting spark dataframe containing int64 fields to R dataframes leads to impredictable errors. |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-5362) Gradient and Optimizer to support generic output (instead of label) and data batches |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25242) Suggestion to make sql config setting fluent |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21722) Enable timezone-aware timestamp type when creating Pandas DataFrame. |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23742) Filter out redundant AssociationRules |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22600) Fix 64kb limit for deeply nested expressions under wholestage codegen |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22743) Consolidate logic for handling spark.driver.memoryOverhead and spark.executor.memoryOverhead |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22926) Respect table-level conf compression codec `Compression` in multiple scenarios |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24841) Memory leak in converting spark dataframe to pandas dataframe |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-17265) EdgeRDD Difference throws an exception |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23982) NoSuchMethodException: There is no startCredentialUpdater method in the object YarnSparkHadoopUtil |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23673) PySpark dayofweek does not conform with ISO 8601 |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22632) Fix the behavior of timestamp values for R's DataFrame to respect session timezone |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25070) BlockFetchingListener#onBlockFetchSuccess throw "java.util.NoSuchElementException: key not found: shuffle_8_68_113" on ShuffleBlockFetcherIterator caused stage hang long time |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23952) remove type parameter in DataReaderFactory |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23350) [SS]Exception when stopping continuous processing application |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24189) Spark Strcutured Streaming not working with the Kafka Transactions |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-2620) case class cannot be used as key for reduce |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25316) Spark error - ERROR ContextCleaner: Error cleaning broadcast 22, Exception thrown in awaitResult: |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24693) Row order preservation for operations on MLlib IndexedRowMatrix |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-13998) HashingTF should extend UnaryTransformer |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24554) Add MapType Support for Arrow in PySpark |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24826) Self-Join not working in Apache Spark 2.2.2 |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23968) allow reading JSON that is composed of pure maps |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24084) Add job group id for query through spark-sql |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23056) parse_url regression when switched to using java.net.URI instead of java.net.URL |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24430) CREATE VIEW with UNION statement: Failed to recognize predicate 'UNION'. |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24298) PCAModel Memory in Pipeline |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-17181) [Spark2.0 web ui]The status of the certain jobs is still displayed as running even if all the stages of this job have already finished |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24651) Add ability to write null values while writing JSON |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23171) Reduce the time costs of the rule runs that do not change the plans |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22869) 64KB JVM bytecode limit problem with filter |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23995) initial job has not accept any resources and executor keep exit |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-9140) Replace TimeTracker by Stopwatch |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-10795) FileNotFoundException while deploying pyspark job on cluster |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22004) CrossValidator, TrainValidationSplit dump sub models to disk when fitting: Scala API |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23681) Switch OrcFileFormat to newer hadoop.mapreduce output classes |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24844) spark REST API need to add ipFilter |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23337) withWatermark raises an exception on struct objects |
Tue, 08 Oct, 05:43 |