spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From 李承霖 <>
Subject Imporvement the cube with the Fast Cubing In apache Kylin
Date Tue, 15 Mar 2016 12:52:52 GMT

I tried to build a cube on a 100 million data set.
When I set 9 fields to build the cube with 10 cores.
It nearly coast me a whole day to finish the job.
At the same time, it generate almost 1”TB“ data in the "/tmp“ folder.
Could we refer to the ”fast cube“ algorithm in apache Kylin

To make the cube builder more quickly???

even run the group by first and generate the cube is more quilk.
View raw message