mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Danny Bickson <danny.bick...@gmail.com>
Subject Re: how to run PCA from Mahout
Date Tue, 06 Sep 2011 12:52:57 GMT
Hi,
As far as I know, PCA is not implemented yet, but you may be able to use SVD
(lanczos algorithm) if
your target is dimensionality reduction.

Best,

DB

On Tue, Sep 6, 2011 at 3:33 PM, Amr Desoky <amr_desoky@yahoo.com> wrote:

> Hi,
>   It is mentioned on the web site :
> https://cwiki.apache.org/confluence/display/MAHOUT/Algorithms
>   That you implement the following algorithms within Mahout :
>      Gaussian Discriminative Analysis
>     Independent Component Analysis
>    Principal Components Analysis
>
> But unfortunately, I could not find any help or documentation  on how to
> use these algorithms!!
> specially  I would like to try PCA on a huge data set of ~10Million vectors
> of 400 components each.
>
> Please give me some help on how to run PCA (and also ICA, GDA) whatever
> available.
>
> Best regards,
> Amr
>
>
> Amr Ibrahim El-Desoky, Mousa
> PhD Student, Computer Science (i6),
> RWTH-Aachen University,
> Aachen, Germany
> Cel.     : +49 0176 56418470
> Office : +49 241 8021620
> Fax      : +49 241 8022219

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message