Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24998) spark-sql will scan the same table repeatedly when doing multi-insert |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22907) MetadataFetchFailedException broadcast is already present |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24503) Implement SparkSQL authorization plugin in Apache Ranger |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20891) Reduce duplicate code in typedaggregators.scala |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-10817) ML abstraction umbrella |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-8799) OneVsRestModel should extend ClassificationModel |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25171) After restart, StreamingContext is replaying the last successful micro-batch right before the stop |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24280) Speed up indexing of files in object stores by using listFiles(path, recursive=true) |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25065) Driver and executors pick the wrong logging configuration file. |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23560) Group by on struct field can add extra shuffle |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24631) Cannot up cast column from bigint to smallint as it may truncate |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22748) Error in query: grouping_id() can only be used with GroupingSets/Cube/Rollup; |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22657) Hadoop fs implementation classes are not loaded if they are part of the app jar or other jar when --packages flag is used |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25367) The column attributes obtained by Spark sql are inconsistent with hive |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24164) Support column list as the pivot column in Pivot |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24483) enableHiveSupport doesn't work with Spark 2.3 on EMR |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24449) ApplicationMaster reporter thread failure counter is not effective |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22402) Allow fetcher URIs to be downloaded to specific locations relative to Mesos Sandbox |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23543) Automatic Module creation fails in Java 9 |
Tue, 08 Oct, 05:43 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23531) When explain, plan's output should include attribute type info |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22741) Add global aggregate for typed aggregation |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-7424) spark.ml classification, regression abstractions should add metadata to output column |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-19680) Offsets out of range with no configured reset policy for partitions |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24088) only HadoopRDD leverage HDFS Cache as preferred location |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23369) HiveClientSuites fails with unresolved dependency |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-5158) Allow for keytab-based HDFS security in Standalone mode |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22035) the value of statistical logicalPlan.stats.sizeInBytes which is not expected |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24162) Support aliased literal values for Pivot "IN" clause |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24842) self-join query fails on different letter case for same field |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-14834) Force adding doc for new api in pyspark with @since annotation |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-19609) Broadcast joins should pushdown join constraints as Filter to the larger relation |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21389) ALS recommendForAll optimization uses Native BLAS |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24827) Some memory waste in History Server by strings in AccumulableInfo objects |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25593) JDBC write Impala, `truncate` true option in Overwrite mode for JDBC DataFrameWriter is dropping and creating the table instead of truncating. |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24524) Improve aggregateMetrics: less memory usage and loops |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23575) ERROR RetryingHMSHandler:159 - AlreadyExistsException(message: |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24728) org.apache.spark.repl.ExecutorClassLoader with cache |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-9120) Add multivariate regression (or prediction) interface |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24264) [Structured Streaming] Remove 'mergeSchema' option from Parquet source configuration |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-18082) Locality Sensitive Hashing (LSH) - SignRandomProjection |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21707) Improvement a special case for non-deterministic filters in optimizer |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24269) Infer nullability rather than declaring all columns as nullable |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-19903) Watermark metadata is lost when using resolved attributes |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-16483) Unifying struct fields and columns |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23833) Incorrect primitive type check for input arguments of udf |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25008) Add memory mode info to showMemoryUsage in TaskMemoryManager |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23832) Adding possibility to set timestamp into KafkaRowWriter |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25397) SparkSession.conf fails when given default value with Python 3 |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21302) history server WebUI show HTTP ERROR 500 |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23074) Dataframe-ified zipwithindex |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-16203) regexp_extract to return an ArrayType(StringType()) |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23221) Fix KafkaContinuousSourceStressForDontFailOnDataLossSuite to run with enough cores |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24922) Iterative rdd union + reduceByKey operations on small dataset leads to "No space left on device" error on account of lot of shuffle spill. |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22202) Release tgz content differences for python and R |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24447) Pyspark RowMatrix.columnSimilarities() loses spark context |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20592) Alter table concatenate is not working as expected. |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-11136) Warm-start support for ML estimator |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20819) Enhance ColumnVector to keep UnsafeArrayData for other types |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20153) Support Multiple aws credentials in order to access multiple Hive on S3 table in spark application |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-17694) convert DataFrame to DataSet should check columns match |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25032) Create table is failing, after dropping the database . It is not falling back to default database |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24410) Missing optimization for Union on bucketed tables |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24604) upgrade to spark 2.3.0 makes MPC model training slower |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23704) PySpark access of individual trees in random forest is slow |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24756) Incorrect Statistics |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20074) Make buffer size in unsafe external sorter configurable |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24587) RDD.takeOrdered uses reduce, pulling all partition data to the driver |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24457) Performance improvement while converting stringToTimestamp in DateTimeUtils |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24382) Spark Structured Streaming aggregation on old timestamp data |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25340) Pushes down Sample beneath deterministic Project |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24616) Need to retreive free memory on command prompt on DSE cluster |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24358) createDataFrame in Python 3 should be able to infer bytes type as Binary type |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21166) Automated ML persistence |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24442) Add configuration parameter to adjust the numbers of records and the characters per row before truncation when a user runs.show() |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23790) proxy-user failed connecting to a kerberos configured metastore |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23983) Disable X-Frame-Options from Spark UI response headers if explicitly configured |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21443) Very long planning duration for queries with lots of operations |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24180) Using another dynamodb endpoint for kinesis |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22005) CrossValidator, TrainValidationSplit dump sub models to disk when fitting: Python API |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22132) Document the Dispatcher REST API |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24463) Add catalyst rule to reorder TypedFilters separated by Filters to reduce serde operations |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22954) ANALYZE TABLE fails with NoSuchTableException for temporary tables (but should have reported "not supported on views") |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24405) parameter for python worker timeout |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24456) Spark submit - server environment variables are overwritten by client environment variables |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24260) Support for multi-statement SQL in SparkSession.sql API |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24784) Retraining (each document as separate file) creates OOME |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-9775) Query Mesos for number of CPUs to set default parallelism |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24394) Nodes in decision tree sometimes have negative impurity values |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25537) spark.pyspark.driver.python when set in code doesnt work |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21730) Consider officially dropping PyPy pre-2.5 support |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25109) spark python should retry reading another datanode if the first one fails to connect |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23777) Missing DAG arrows between stages |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21405) Add LBFGS solver for GeneralizedLinearRegression |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-18492) GeneratedIterator grows beyond 64 KB |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24342) Large Task prior scheduling to Reduce overall execution time |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-17570) Avoid Hash and Exchange in Sort Merge join if bucketing factor is multiple for tables |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23650) Slow SparkR udf (dapply) |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23744) Memory leak in ReadableChannelFileRegion |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24144) monotonically_increasing_id on streaming dataFrames |
Tue, 08 Oct, 05:44 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24838) Support uncorrelated IN/EXISTS subqueries for more operators |
Tue, 08 Oct, 05:44 |