spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joseph K. Bradley (JIRA)" <>
Subject [jira] [Commented] (SPARK-14503) Scala API for FPGrowth
Date Tue, 28 Feb 2017 23:14:45 GMT


Joseph K. Bradley commented on SPARK-14503:

Sorry for the slow reply.  I actually haven't read enough about PrefixSpan to say, and the
original paper doesn't seem to cover rule generation.

We're going with PrefixSpanModel currently.  The benefit from combining the models seems pretty
small given the unknowns here, and if they can share implementations, we can do so internally
in the future.

> Scala API for FPGrowth
> -------------------------------
>                 Key: SPARK-14503
>                 URL:
>             Project: Spark
>          Issue Type: Sub-task
>          Components: ML
>            Reporter: Joseph K. Bradley
>            Assignee: yuhao yang
> This task is the first port of spark.mllib.fpm functionality to (Scala).
> This will require a brief design doc to confirm a reasonable DataFrame-based API, with
details for this class.  The doc could also look ahead to the other fpm classes, especially
if their API decisions will affect FPGrowth.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message