spark-issues mailing list archives: October 2019

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · 22 · 23 · 24 · 25 · 26 · 27 · Next »Thread · Author · Date
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23730) Save and expose "in bag" tracking for random forest model Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23631) Add summary to RandomForestClassificationModel Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-21940) Support timezone for timestamps in SparkR Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23987) Unused mailing lists Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24025) Join of bucketed and non-bucketed tables can give two exchanges and sorts for non-bucketed side Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-20443) The blockSize of MLLIB ALS should be setting by the User Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-21199) Its not possible to impute Vector types Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24848) When a stage fails onStageCompleted is called before onTaskEnd Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24974) Spark put all file's paths into SharedInMemoryCache even for unused partitions. Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24425) Regression from 1.6 to 2.x - Spark no longer respects input partitions, unnecessary shuffle required Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-21084) Improvements to dynamic allocation for notebook use cases Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24837) Add kafka as spark metrics sink Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23996) Implement the optimal KLL algorithms for quantiles in streams Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-12878) Dataframe fails with nested User Defined Types Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24729) Spark - stackoverflow error - org.apache.spark.sql.catalyst.plans.QueryPlan Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-15573) Backwards-compatible persistence for spark.ml Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23839) consider bucket join in cost-based JoinReorder rule Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24843) Spark2 job (in cluster mode) is unable to execute steps in HBase (error# java.lang.NoClassDefFoundError: org/apache/hadoop/hbase/CompatibilityFactory) Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-12126) JDBC datasource processes filters only commonly pushed down. Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-25180) Spark standalone failure in Utils.doFetchFile() if nslookup of local hostname fails Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-21885) HiveMetastoreCatalog.InferIfNeeded too slow when caseSensitiveInference enabled Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-21040) On executor/worker decommission consider speculatively re-launching current tasks Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-25103) CompletionIterator may delay GC of completed resources Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24738) [HistoryServer] FsHistoryProvider clean outdated event logs at start Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-5572) LDA improvement listing Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-8614) Row order preservation for operations on MLlib IndexedRowMatrix Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-14604) Modify design of ML model summaries Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23292) python tests related to pandas are skipped with python 2 Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-21962) Distributed Tracing in Spark Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-18822) Support ML Pipeline in SparkR Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24473) It is no need to clip the predictive value by maxValue and minValue when computing gradient on SVDplusplus model Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-25030) SparkSubmit.doSubmit will not return result if the mainClass submitted creates a Timer() Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-4285) Transpose RDD[Vector] to column store for ML Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-25311) `SPARK_LOCAL_HOSTNAME` unsupport IPV6 when do host checking Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23740) Add FPGrowth Param for filtering out very common items Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-25351) Handle Pandas category type when converting from Python with Arrow Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-20295) when spark.sql.adaptive.enabled is enabled, have conflict with Exchange Resue Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-15041) adding mode strategy for ml.feature.Imputer for categorical features Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-15690) Fast single-node (single-process) in-memory shuffle Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-22918) sbt test (spark - local) fail after upgrading to 2.2.1 with: java.security.AccessControlException: access denied org.apache.derby.security.SystemPermission( "engine", "usederbyinternals" ) Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24607) Distribute by rand() can lead to data inconsistency Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23328) Disallow default value None in na.replace/replace when 'to_replace' is not a dictionary Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-25020) Unable to Perform Graceful Shutdown in Spark Streaming with Hadoop 2.8 Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24081) Spark SQL drops the table while writing into table in "overwrite" mode. Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-8582) Optimize checkpointing to avoid computing an RDD twice Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24431) wrong areaUnderPR calculation in BinaryClassificationEvaluator Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-20629) Copy shuffle data when nodes are being shut down Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23181) Add compatibility tests for SHS serialized data / disk format Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23368) Avoid unnecessary Exchange or Sort after projection Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24293) Serialized shuffle supports mapSideCombine Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23837) Create table as select gives exception if the spark generated alias name contains comma Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-25217) Error thrown when creating BlockMatrix Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23236) Make it easier to find the rest API, especially in local mode Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23669) Executors fetch jars and name the jars with md5 prefix Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-21353) add checkValue in spark.internal.config about how to correctly set configurations Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-22415) lint-r fails if lint-r.R installs any new packages Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-18600) BZ2 CRC read error needs better reporting Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-25585) Allow users to specify scale of result in Decimal arithmetic Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-22031) KMeans - Compute cost for a single vector Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-20732) Copy cache data when node is being shut down Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23858) Need to apply pyarrow adjustments to complex types with DateType/TimestampType Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-19241) remove hive generated table properties if they are not useful in Spark Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24656) SparkML Transformers and Estimators with multiple columns Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24689) java.io.NotSerializableException: org.apache.spark.mllib.clustering.DistributedLDAModel Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24049) Add a feature to not start speculative tasks when average task duration is less than a configurable absolute number Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-22245) dataframe should always put partition columns at the end Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24748) Support for reporting custom metrics via Streaming Query Progress Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-25428) Support plain Kerberos Authentication with Spark Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24240) Add a config to control whether InMemoryFileIndex should update cache when refresh. Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-22731) Add a test for ROWID type to OracleIntegrationSuite Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23797) SparkSQL performance on small TPCDS tables is very low when compared to Drill or Presto Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24910) Spark Bloom Filter Closure Serialization improvement for very high volume of Data Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24964) Please add OWASP Dependency Check to all comonent builds(pom.xml) Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24905) Spark 2.3 Internal URL env variable Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23664) Add interface to collect query result through file iterator Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-21536) Remove the workaroud to allow dots in field names in R's createDataFame Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24301) Add Instrumentation test coverage Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24221) Retry spark app submission to k8 in KubernetesClientApplication Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-21406) Add logLikelihood to GLR families Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24939) Support YARN Shared Cache in Spark Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-22565) Session-based windowing Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24745) Map function does not keep rdd name Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24122) Allow automatic driver restarts on K8s Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-23686) Make better usage of org.apache.spark.ml.util.Instrumentation Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24617) Spark driver not requesting another executor once original executor exits due to 'lost worker' Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24585) Adding ability to audit file system before and after test to ensure all files are cleaned up. Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-21076) R dapply doesn't return array or raw columns when array have different length Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24750) HiveCaseSensitiveInferenceMode with INFER_AND_SAVE will show WRITE permission denied even if select table operation Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-18245) Improving support for bucketed table Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-25107) Spark 2.2.0 Upgrade Issue : Throwing TreeNodeException: makeCopy, tree: CatalogRelation Errors Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-15516) Schema merging in driver fails for parquet when merging LongType and IntegerType Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-15691) Refactor and improve Hive support Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24266) Spark client terminates while driver is still running Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-25219) KMeans Clustering - Text Data - Results are incorrect Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-15777) Catalog federation Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24550) Add support for Kubernetes specific metrics Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24114) improve instrumentation for spark.ml.recommendation Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-20618) Support Custom Partitioners in PySpark Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-24357) createDataFrame in Python infers large integers as long type and then fails silently when converting them Tue, 08 Oct, 05:42
Hyukjin Kwon (Jira) [jira] [Resolved] (SPARK-21016) Improve code fault tolerance for converting string to number Tue, 08 Oct, 05:43
Message list« Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · 9 · 10 · 11 · 12 · 13 · 14 · 15 · 16 · 17 · 18 · 19 · 20 · 21 · 22 · 23 · 24 · 25 · 26 · 27 · Next »Thread · Author · Date
Box list
Mar 2021753
Feb 20212404
Jan 20213118
Dec 20203114
Nov 20202574
Oct 20202364
Sep 20202465
Aug 20202548
Jul 20202987
Jun 20202495
May 20202296
Apr 20202233
Mar 20202569
Feb 20201689
Jan 20202030
Dec 20191984
Nov 20191999
Oct 20192605
Sep 20192562
Aug 20191971
Jul 20192560
Jun 20192148
May 20196634
Apr 20191770
Mar 20192729
Feb 20191990
Jan 20192470
Dec 20183548
Nov 20182719
Oct 20183017
Sep 20182838
Aug 20183000
Jul 20182380
Jun 20182087
May 20182671
Apr 20182287
Mar 20182229
Feb 20182319
Jan 20183319
Dec 20172281
Nov 20172408
Oct 20172299
Sep 20172506
Aug 20173310
Jul 20172915
Jun 20173101
May 20173314
Apr 20173090
Mar 20173629
Feb 20173151
Jan 20173354
Dec 20163726
Nov 20164722
Oct 20164453
Sep 20163741
Aug 20164396
Jul 20164608
Jun 20165512
May 20165539
Apr 20166102
Mar 20166117
Feb 20164280
Jan 20164706
Dec 20154974
Nov 20156194
Oct 20154747
Sep 20154598
Aug 20157076
Jul 20156772
Jun 20156218
May 20156438
Apr 20155360
Mar 20154473
Feb 20154636
Jan 20153246
Dec 20142821
Nov 20143092
Oct 20142923
Sep 20142921
Aug 20142949
Jul 20142711
Jun 20141751
May 20141587
Apr 20141553
Mar 2014189