Kevin Ma (Jira) |
[jira] [Created] (SPARK-31143) Spark 2.4.4 count distinct query much slower than Spark 1.6.2 and Hive 1.2.1 |
Fri, 13 Mar, 06:29 |
Kevin Ma (Jira) |
[jira] [Resolved] (SPARK-31143) Spark 2.4.4 count distinct query much slower than Spark 1.6.2 and Hive 1.2.1 |
Fri, 13 Mar, 07:03 |
Pavol Vidlička (Jira) |
[jira] [Created] (SPARK-31196) Server-side processing of History UI list of applications |
Thu, 19 Mar, 20:59 |
Pavol Vidlička (Jira) |
[jira] [Updated] (SPARK-31196) Server-side processing of History UI list of applications |
Thu, 19 Mar, 21:00 |
Pavol Vidlička (Jira) |
[jira] [Updated] (SPARK-31196) Server-side processing of History UI list of applications |
Thu, 19 Mar, 21:02 |
Pavol Vidlička (Jira) |
[jira] [Updated] (SPARK-31196) Server-side processing of History UI list of applications |
Thu, 19 Mar, 21:03 |
Pavol Vidlička (Jira) |
[jira] [Updated] (SPARK-31196) Server-side processing of History UI list of applications |
Thu, 19 Mar, 21:32 |
Pavol Vidlička (Jira) |
[jira] [Resolved] (SPARK-31196) Server-side processing of History UI list of applications |
Wed, 25 Mar, 15:49 |
Piotr Skąpski (Jira) |
[jira] [Created] (SPARK-31174) unix_timestamp() function returning NULL values for corner cases (daylight saving) |
Tue, 17 Mar, 16:05 |
Aditya Addepalli (Jira) |
[jira] [Created] (SPARK-31094) Removing redundant rules in the output of Frequent Pattern Growth Algorithm |
Mon, 09 Mar, 13:00 |
Aki Ariga (Jira) |
[jira] [Commented] (SPARK-30966) spark.createDataFrame fails with pandas DataFrame including pandas.NA |
Wed, 04 Mar, 05:44 |
Alex Favaro (Jira) |
[jira] [Created] (SPARK-31122) Add support for sparse matrix multiplication |
Wed, 11 Mar, 12:02 |
Alexander Tronchin-James (Jira) |
[jira] [Commented] (SPARK-29367) pandas udf not working with latest pyarrow release (0.15.0) |
Fri, 06 Mar, 19:19 |
Alfred Davidson (Jira) |
[jira] [Commented] (SPARK-31281) Hit OOM Error - GC Limit |
Sun, 29 Mar, 18:47 |
Alfred Davidson (Jira) |
[jira] [Comment Edited] (SPARK-31281) Hit OOM Error - GC Limit |
Sun, 29 Mar, 18:55 |
Alfred Davidson (Jira) |
[jira] [Comment Edited] (SPARK-31281) Hit OOM Error - GC Limit |
Sun, 29 Mar, 18:55 |
Ali Afroozeh (Jira) |
[jira] [Created] (SPARK-31192) Introduce PushProjectThroughLimit |
Thu, 19 Mar, 14:37 |
Anton Okolnychyi (Jira) |
[jira] [Commented] (SPARK-23889) DataSourceV2: Add interfaces to pass required sorting and clustering for writes |
Fri, 27 Mar, 19:59 |
Anton Okolnychyi (Jira) |
[jira] [Comment Edited] (SPARK-23889) DataSourceV2: Add interfaces to pass required sorting and clustering for writes |
Fri, 27 Mar, 20:00 |
Arsenii Venherak (Jira) |
[jira] [Created] (SPARK-31149) PySpark job not killing Spark Daemon processes after the executor is killed due to OOM |
Fri, 13 Mar, 15:28 |
Arsenii Venherak (Jira) |
[jira] [Updated] (SPARK-31149) PySpark job not killing Spark Daemon processes after the executor is killed due to OOM |
Fri, 13 Mar, 15:33 |
Arsenii Venherak (Jira) |
[jira] [Commented] (SPARK-31149) PySpark job not killing Spark Daemon processes after the executor is killed due to OOM |
Fri, 13 Mar, 15:36 |
Attila Zsolt Piros (Jira) |
[jira] [Commented] (SPARK-27651) Avoid the network when block manager fetches shuffle blocks from the same host |
Thu, 05 Mar, 10:46 |
Attila Zsolt Piros (Jira) |
[jira] [Updated] (SPARK-27651) Avoid the network when block manager fetches shuffle blocks from the same host |
Thu, 05 Mar, 10:49 |
Attila Zsolt Piros (Jira) |
[jira] [Commented] (SPARK-27651) Avoid the network when block manager fetches shuffle blocks from the same host |
Thu, 05 Mar, 17:16 |
Ayoub Omari (Jira) |
[jira] [Created] (SPARK-31194) spark sql runs successfully with query not specifying condition next to where |
Thu, 19 Mar, 15:09 |
Ayoub Omari (Jira) |
[jira] [Updated] (SPARK-31194) spark sql runs successfully with query not specifying condition next to where |
Thu, 19 Mar, 15:10 |
Ayoub Omari (Jira) |
[jira] [Comment Edited] (SPARK-31194) spark sql runs successfully with query not specifying condition next to where |
Mon, 23 Mar, 10:29 |
Ayoub Omari (Jira) |
[jira] [Closed] (SPARK-31194) spark sql runs successfully with query not specifying condition next to where |
Mon, 23 Mar, 10:29 |
Ben (Jira) |
[jira] [Created] (SPARK-31306) rand() function documentation suggests an inclusive upper bound of 1.0 |
Mon, 30 Mar, 17:29 |
Ben (Jira) |
[jira] [Updated] (SPARK-31306) rand() function documentation suggests an inclusive upper bound of 1.0 |
Mon, 30 Mar, 17:32 |
Ben (Jira) |
[jira] [Updated] (SPARK-31306) rand() function documentation suggests an inclusive upper bound of 1.0 |
Mon, 30 Mar, 17:33 |
Ben (Jira) |
[jira] [Updated] (SPARK-31306) rand() function documentation suggests an inclusive upper bound of 1.0 |
Mon, 30 Mar, 17:33 |
Ben (Jira) |
[jira] [Updated] (SPARK-31306) rand() function documentation suggests an inclusive upper bound of 1.0 |
Mon, 30 Mar, 17:34 |
Ben (Jira) |
[jira] [Updated] (SPARK-31306) rand() function documentation suggests an inclusive upper bound of 1.0 |
Mon, 30 Mar, 17:35 |
Bruce Robbins (Jira) |
[jira] [Commented] (SPARK-30951) Potential data loss for legacy applications after switch to proleptic Gregorian calendar |
Tue, 24 Mar, 20:10 |
Bruce Robbins (Jira) |
[jira] [Created] (SPARK-31238) Incompatible ORC dates with Spark 2.4 |
Tue, 24 Mar, 20:51 |
Bruce Robbins (Jira) |
[jira] [Commented] (SPARK-30951) Potential data loss for legacy applications after switch to proleptic Gregorian calendar |
Tue, 24 Mar, 20:55 |
Bryan Cutler (Jira) |
[jira] [Resolved] (SPARK-30961) Arrow enabled: to_pandas with date column fails |
Fri, 06 Mar, 19:23 |
Bryan Cutler (Jira) |
[jira] [Commented] (SPARK-30961) Arrow enabled: to_pandas with date column fails |
Fri, 06 Mar, 19:25 |
Burak Yavuz (Jira) |
[jira] [Created] (SPARK-31061) Impossible to change the provider of a table in the HiveMetaStore |
Fri, 06 Mar, 00:50 |
Burak Yavuz (Jira) |
[jira] [Created] (SPARK-31178) sql("INSERT INTO v2DataSource ...").collect() double inserts |
Wed, 18 Mar, 01:22 |
Burak Yavuz (Jira) |
[jira] [Commented] (SPARK-31178) sql("INSERT INTO v2DataSource ...").collect() double inserts |
Wed, 18 Mar, 01:23 |
Burak Yavuz (Jira) |
[jira] [Resolved] (SPARK-31178) sql("INSERT INTO v2DataSource ...").collect() double inserts |
Thu, 19 Mar, 01:13 |
Burak Yavuz (Jira) |
[jira] [Created] (SPARK-31278) numOutputRows shows value from last micro batch when there is no new data |
Thu, 26 Mar, 19:40 |
CacheCheck (Jira) |
[jira] [Commented] (SPARK-29878) Improper cache strategies in GraphX |
Sun, 22 Mar, 14:42 |
CacheCheck (Jira) |
[jira] [Created] (SPARK-31216) RDDs in GradientBoostedTress can be unpersisted earlier for saving memory |
Sun, 22 Mar, 15:05 |
CacheCheck (Jira) |
[jira] [Created] (SPARK-31217) Unnecessary persist on cumulativeCounts in BinaryClassificationMetrics |
Sun, 22 Mar, 15:33 |
CacheCheck (Jira) |
[jira] [Created] (SPARK-31218) counts in BinaryClassificationMetrics should be cached |
Sun, 22 Mar, 16:08 |
CacheCheck (Jira) |
[jira] [Commented] (SPARK-31217) Unnecessary persist on cumulativeCounts in BinaryClassificationMetrics |
Sun, 22 Mar, 16:28 |
CacheCheck (Jira) |
[jira] [Comment Edited] (SPARK-31217) Unnecessary persist on cumulativeCounts in BinaryClassificationMetrics |
Sun, 22 Mar, 16:30 |
CacheCheck (Jira) |
[jira] [Commented] (SPARK-31218) counts in BinaryClassificationMetrics should be cached |
Wed, 25 Mar, 08:21 |
CacheCheck (Jira) |
[jira] [Comment Edited] (SPARK-31218) counts in BinaryClassificationMetrics should be cached |
Wed, 25 Mar, 08:38 |
CacheCheck (Jira) |
[jira] [Comment Edited] (SPARK-31218) counts in BinaryClassificationMetrics should be cached |
Wed, 25 Mar, 08:41 |
DB Tsai (Jira) |
[jira] [Created] (SPARK-31026) Enable Parquet predicate pushdown on columns with dots |
Tue, 03 Mar, 19:57 |
DB Tsai (Jira) |
[jira] [Created] (SPARK-31027) Refactor `DataSourceStrategy.scala` to minimize the changes to support nested predicate pushdown |
Tue, 03 Mar, 20:08 |
DB Tsai (Jira) |
[jira] [Updated] (SPARK-31026) Parquet predicate pushdown on columns with dots |
Tue, 03 Mar, 23:56 |
DB Tsai (Jira) |
[jira] [Updated] (SPARK-31027) Refactor `DataSourceStrategy.scala` to minimize the changes to support nested predicate pushdown |
Wed, 04 Mar, 19:08 |
DB Tsai (Jira) |
[jira] [Created] (SPARK-31058) Consolidate the implementation of quoteIfNeeded |
Thu, 05 Mar, 19:14 |
DB Tsai (Jira) |
[jira] [Created] (SPARK-31060) Handle column names containing `dots` in data source `Filter` |
Thu, 05 Mar, 22:44 |
DB Tsai (Jira) |
[jira] [Assigned] (SPARK-31058) Consolidate the implementation of quoteIfNeeded |
Fri, 06 Mar, 00:15 |
DB Tsai (Jira) |
[jira] [Resolved] (SPARK-31058) Consolidate the implementation of quoteIfNeeded |
Fri, 06 Mar, 00:15 |
DB Tsai (Jira) |
[jira] [Created] (SPARK-31064) New Parquet Predicate Filter APIs with multi-part Identifier Support |
Fri, 06 Mar, 04:09 |
DB Tsai (Jira) |
[jira] [Assigned] (SPARK-31064) New Parquet Predicate Filter APIs with multi-part Identifier Support |
Fri, 06 Mar, 21:11 |
DB Tsai (Jira) |
[jira] [Resolved] (SPARK-31064) New Parquet Predicate Filter APIs with multi-part Identifier Support |
Fri, 06 Mar, 21:11 |
DB Tsai (Jira) |
[jira] [Assigned] (SPARK-22231) Support of map, filter, withColumn, dropColumn in nested list of structures |
Tue, 31 Mar, 16:42 |
DB Tsai (Jira) |
[jira] [Created] (SPARK-31317) Add withField method to Column class |
Tue, 31 Mar, 16:51 |
DB Tsai (Jira) |
[jira] [Commented] (SPARK-22231) Support of map, filter, withColumn, dropColumn in nested list of structures |
Tue, 31 Mar, 16:53 |
DB Tsai (Jira) |
[jira] [Commented] (SPARK-17636) Parquet predicate pushdown for nested fields |
Tue, 31 Mar, 20:43 |
Dan Ziemba (Jira) |
[jira] [Commented] (SPARK-16859) History Server storage information is missing |
Mon, 30 Mar, 21:45 |
Dongjoon Hyun (Jira) |
[jira] [Commented] (SPARK-30986) Structured Streaming: mapGroupsWithState UDT serialization does not work |
Sun, 01 Mar, 23:47 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-30986) Structured Streaming: mapGroupsWithState UDT serialization does not work |
Sun, 01 Mar, 23:51 |
Dongjoon Hyun (Jira) |
[jira] [Commented] (SPARK-30993) GenerateUnsafeRowJoiner corrupts the value if the datatype is UDF and its sql type has fixed length |
Sun, 01 Mar, 23:52 |
Dongjoon Hyun (Jira) |
[jira] [Commented] (SPARK-29969) parse_url function result in incorrect result |
Mon, 02 Mar, 01:15 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-29419) Seq.toDS / spark.createDataset(Seq) is not thread-safe |
Mon, 02 Mar, 01:19 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-29419) Seq.toDS / spark.createDataset(Seq) is not thread-safe |
Mon, 02 Mar, 01:20 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-29419) Seq.toDS / spark.createDataset(Seq) is not thread-safe |
Mon, 02 Mar, 01:21 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-29419) Seq.toDS / spark.createDataset(Seq) is not thread-safe |
Mon, 02 Mar, 01:22 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-25206) wrong records are returned when Hive metastore schema and parquet schema are in different letter cases |
Mon, 02 Mar, 05:42 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-22951) count() after dropDuplicates() on emptyDataFrame returns incorrect value |
Mon, 02 Mar, 07:20 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-24957) Decimal arithmetic can lead to wrong values using codegen |
Mon, 02 Mar, 07:54 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-24957) Decimal arithmetic can lead to wrong values using codegen |
Mon, 02 Mar, 07:55 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-25114) RecordBinaryComparator may return wrong result when subtraction between two words is divisible by Integer.MAX_VALUE |
Mon, 02 Mar, 08:02 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-25591) PySpark Accumulators with multiple PythonUDFs |
Mon, 02 Mar, 08:12 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-25591) PySpark Accumulators with multiple PythonUDFs |
Mon, 02 Mar, 08:22 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-25708) HAVING without GROUP BY means global aggregate |
Mon, 02 Mar, 08:23 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-26154) Stream-stream joins - left outer join gives inconsistent output |
Mon, 02 Mar, 08:45 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-26352) join reordering should not change the order of output attributes |
Mon, 02 Mar, 08:55 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-26352) join reordering should not change the order of output attributes |
Mon, 02 Mar, 08:58 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-26366) Except with transform regression |
Mon, 02 Mar, 09:00 |
Dongjoon Hyun (Jira) |
[jira] [Resolved] (SPARK-30993) GenerateUnsafeRowJoiner corrupts the value if the datatype is UDF and its sql type has fixed length |
Mon, 02 Mar, 19:27 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-30993) GenerateUnsafeRowJoiner corrupts the value if the datatype is UDF and its sql type has fixed length |
Mon, 02 Mar, 19:28 |
Dongjoon Hyun (Jira) |
[jira] [Resolved] (SPARK-30986) Structured Streaming: mapGroupsWithState UDT serialization does not work |
Mon, 02 Mar, 19:28 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-26572) Join on distinct column with monotonically_increasing_id produces wrong output |
Mon, 02 Mar, 19:37 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-26572) Join on distinct column with monotonically_increasing_id produces wrong output |
Mon, 02 Mar, 19:37 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-26682) Task attempt ID collision causes lost data |
Mon, 02 Mar, 19:46 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-23523) Incorrect result caused by the rule OptimizeMetadataOnlyQuery |
Mon, 02 Mar, 19:53 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-26709) OptimizeMetadataOnlyQuery does not correctly handle the files with zero record |
Mon, 02 Mar, 19:53 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-23523) Incorrect result caused by the rule OptimizeMetadataOnlyQuery |
Mon, 02 Mar, 19:54 |
Dongjoon Hyun (Jira) |
[jira] [Updated] (SPARK-26812) PushProjectionThroughUnion nullability issue |
Mon, 02 Mar, 20:07 |