mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Vincent Xue <>
Subject Transposing a matrix is limited by how large a node is.
Date Fri, 06 May 2011 13:01:42 GMT
Dear Mahout Users,

I am using Mahout-0.5-SNAPSHOT to transpose a dense matrix of 55000 x 31000.
My matrix is in stored on the HDFS as a
SequenceFile<IntWritable,VectorWritable>, consuming just about 13 GB. When I
run the transpose function on my matrix, the function falls over during the
reduce phase. With closer inspection, I noticed that I was receiving the
following error:

FSError: No space left on device

I thought this was not possible considering that I was only using 15% of the
2.5 TB in the cluster but when I closely monitored the disk space, it was
true that the 40 GB hard drive on the node was running out of space.
Unfortunately, all of my nodes are limited to 40 GB and I have not been
successful in transposing my matrix.

>From this observation, I would like to know if there is any alternative
method to transpose my matrix or if there is something I am missing?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message