mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Florian Leibert <>
Subject Vector creation - out of memory error
Date Mon, 20 Jul 2009 18:40:21 GMT
I'm trying to create vectors with Mahout as explained in,
however I keep running out of heap. My heap is set to 2 GB already and I use
these parameters:
"java org.apache.mahout.utils.vectors.Driver --dir /LUCENE/ind --output
/user/florian/index-vectors-01 --field content --dictOut
/user/florian/index-dict-01 --weight TF".

My index currently is about 6 GB large. Is there any way to compute the
vectors in a distributed manner? What's the largest index someone has
created vectors from?



  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message