mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: Regarding the scalability of SVD code in Mahout
Date Wed, 08 Sep 2010 01:50:30 GMT
Just to cross-check, is it true that your data has 35 x 100 million
non-zeros in it?

On Tue, Sep 7, 2010 at 6:16 PM, Akshay Bhat <akshayubhat@gmail.com> wrote:

> > - the total number of non-zero elements.  This drives the scan time and,
> to
> > some extent the cost of the multiplies.
> >
> The total number of non-zero elements are small since, most of the twitter
> users follow on average around 100 other users
>
> ...
> > - the number of rows in the original matrix.  This is a secondary factor
> > that can drive some intermediate products in the random projection.
> >
> > The number of rows is around 35 Million

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message