Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25087) update/port all builds to run on ubuntu 16.04 LTS |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23910) Publish executor memory utilisation in heartbeat events |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24163) Support "ANY" or sub-query for Pivot "IN" clause |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25022) Add spark.executor.pyspark.memory support to Mesos |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24273) Failure while using .checkpoint method to private S3 store via S3A connector |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21024) CSV parse mode handles Univocity parser exceptions |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22359) Improve the test coverage of window functions |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24494) Give users possibility to skip own classes in SparkContext.getCallSite() |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24955) spark continuing to execute on a task despite not reading all data from a downed machine |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24764) Add ServiceLoader implementation for SparkHadoopUtil |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-16418) DataFrame.filter fails if it references a window function |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23796) There's no API to change state RDD's name |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22114) The condition of OnlineLDAOptimizer convergence should be configurable |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23800) Support partial function and callable object with pandas UDF |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22728) Unify artifact access for (mesos, standalone and yarn) when HDFS is available |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-9636) Treat $SPARK_HOME as write-only |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24208) Cannot resolve column in self join after applying Pandas UDF |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23537) Logistic Regression without standardization |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23730) Save and expose "in bag" tracking for random forest model |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23631) Add summary to RandomForestClassificationModel |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21940) Support timezone for timestamps in SparkR |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23987) Unused mailing lists |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24025) Join of bucketed and non-bucketed tables can give two exchanges and sorts for non-bucketed side |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20443) The blockSize of MLLIB ALS should be setting by the User |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21199) Its not possible to impute Vector types |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24848) When a stage fails onStageCompleted is called before onTaskEnd |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24974) Spark put all file's paths into SharedInMemoryCache even for unused partitions. |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24425) Regression from 1.6 to 2.x - Spark no longer respects input partitions, unnecessary shuffle required |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21084) Improvements to dynamic allocation for notebook use cases |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24837) Add kafka as spark metrics sink |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23996) Implement the optimal KLL algorithms for quantiles in streams |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-12878) Dataframe fails with nested User Defined Types |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24729) Spark - stackoverflow error - org.apache.spark.sql.catalyst.plans.QueryPlan |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-15573) Backwards-compatible persistence for spark.ml |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23839) consider bucket join in cost-based JoinReorder rule |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24843) Spark2 job (in cluster mode) is unable to execute steps in HBase (error# java.lang.NoClassDefFoundError: org/apache/hadoop/hbase/CompatibilityFactory) |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-12126) JDBC datasource processes filters only commonly pushed down. |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25180) Spark standalone failure in Utils.doFetchFile() if nslookup of local hostname fails |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21885) HiveMetastoreCatalog.InferIfNeeded too slow when caseSensitiveInference enabled |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21040) On executor/worker decommission consider speculatively re-launching current tasks |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25103) CompletionIterator may delay GC of completed resources |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24738) [HistoryServer] FsHistoryProvider clean outdated event logs at start |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-5572) LDA improvement listing |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-8614) Row order preservation for operations on MLlib IndexedRowMatrix |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-14604) Modify design of ML model summaries |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23292) python tests related to pandas are skipped with python 2 |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21962) Distributed Tracing in Spark |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-18822) Support ML Pipeline in SparkR |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24473) It is no need to clip the predictive value by maxValue and minValue when computing gradient on SVDplusplus model |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25030) SparkSubmit.doSubmit will not return result if the mainClass submitted creates a Timer() |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-4285) Transpose RDD[Vector] to column store for ML |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25311) `SPARK_LOCAL_HOSTNAME` unsupport IPV6 when do host checking |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23740) Add FPGrowth Param for filtering out very common items |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25351) Handle Pandas category type when converting from Python with Arrow |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20295) when spark.sql.adaptive.enabled is enabled, have conflict with Exchange Resue |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-15041) adding mode strategy for ml.feature.Imputer for categorical features |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-15690) Fast single-node (single-process) in-memory shuffle |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22918) sbt test (spark - local) fail after upgrading to 2.2.1 with: java.security.AccessControlException: access denied org.apache.derby.security.SystemPermission( "engine", "usederbyinternals" ) |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24607) Distribute by rand() can lead to data inconsistency |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23328) Disallow default value None in na.replace/replace when 'to_replace' is not a dictionary |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25020) Unable to Perform Graceful Shutdown in Spark Streaming with Hadoop 2.8 |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24081) Spark SQL drops the table while writing into table in "overwrite" mode. |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-8582) Optimize checkpointing to avoid computing an RDD twice |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24431) wrong areaUnderPR calculation in BinaryClassificationEvaluator |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20629) Copy shuffle data when nodes are being shut down |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23181) Add compatibility tests for SHS serialized data / disk format |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23368) Avoid unnecessary Exchange or Sort after projection |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24293) Serialized shuffle supports mapSideCombine |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23837) Create table as select gives exception if the spark generated alias name contains comma |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25217) Error thrown when creating BlockMatrix |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23236) Make it easier to find the rest API, especially in local mode |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23669) Executors fetch jars and name the jars with md5 prefix |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21353) add checkValue in spark.internal.config about how to correctly set configurations |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22415) lint-r fails if lint-r.R installs any new packages |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-18600) BZ2 CRC read error needs better reporting |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25585) Allow users to specify scale of result in Decimal arithmetic |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22031) KMeans - Compute cost for a single vector |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-20732) Copy cache data when node is being shut down |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23858) Need to apply pyarrow adjustments to complex types with DateType/TimestampType |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-19241) remove hive generated table properties if they are not useful in Spark |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24656) SparkML Transformers and Estimators with multiple columns |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24689) java.io.NotSerializableException: org.apache.spark.mllib.clustering.DistributedLDAModel |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24049) Add a feature to not start speculative tasks when average task duration is less than a configurable absolute number |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22245) dataframe should always put partition columns at the end |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24748) Support for reporting custom metrics via Streaming Query Progress |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-25428) Support plain Kerberos Authentication with Spark |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24240) Add a config to control whether InMemoryFileIndex should update cache when refresh. |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22731) Add a test for ROWID type to OracleIntegrationSuite |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23797) SparkSQL performance on small TPCDS tables is very low when compared to Drill or Presto |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24910) Spark Bloom Filter Closure Serialization improvement for very high volume of Data |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24964) Please add OWASP Dependency Check to all comonent builds(pom.xml) |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24905) Spark 2.3 Internal URL env variable |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-23664) Add interface to collect query result through file iterator |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21536) Remove the workaroud to allow dots in field names in R's createDataFame |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24301) Add Instrumentation test coverage |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24221) Retry spark app submission to k8 in KubernetesClientApplication |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-21406) Add logLikelihood to GLR families |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24939) Support YARN Shared Cache in Spark |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-22565) Session-based windowing |
Tue, 08 Oct, 05:42 |
Hyukjin Kwon (Jira) |
[jira] [Resolved] (SPARK-24745) Map function does not keep rdd name |
Tue, 08 Oct, 05:42 |