drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: A list of questions on Dremel (or Apache Drill)'s columnar storage
Date Tue, 28 Aug 2012 11:39:28 GMT
Can't do variable block size in vanilla hadoop.  That is part of the whole
namenode legacy.

On Tue, Aug 28, 2012 at 2:56 AM, Min Zhou <coderplay@gmail.com> wrote:

> 1. If it's one data file for each column, data locality is difficult to
>    guarantee when rebuilding a row from column files. Unless
>    that GFS can keep all fields from the same row in files of the
>    same node. Moreover that, data block can't be a fixed
>    size like 1MB/64MB/128MB, cuz
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message