spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sean Owen (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (SPARK-3439) Add Canopy Clustering Algorithm
Date Sat, 23 Jan 2016 12:55:39 GMT

     [ https://issues.apache.org/jira/browse/SPARK-3439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Sean Owen resolved SPARK-3439.
------------------------------
    Resolution: Won't Fix

> Add Canopy Clustering Algorithm
> -------------------------------
>
>                 Key: SPARK-3439
>                 URL: https://issues.apache.org/jira/browse/SPARK-3439
>             Project: Spark
>          Issue Type: New Feature
>          Components: MLlib
>            Reporter: Yu Ishikawa
>            Assignee: Muhammad-Ali A'rabi
>            Priority: Minor
>              Labels: clustering
>
> The canopy clustering algorithm is an unsupervised pre-clustering algorithm. It is often
used as a preprocessing step for the K-means algorithm or the Hierarchical clustering algorithm.
It is intended to speed up clustering operations on large data sets, where using another algorithm
directly may be impractical due to the size of the data set.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message