hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Bill Graham <billgra...@gmail.com>
Subject Re: MapReduce job reading directly from the HBase files in HDFS
Date Fri, 06 May 2011 21:18:27 GMT
One big reason is that there will be updates in the memory store that aren't
yet written to HFiles. You'll miss these.

On Fri, May 6, 2011 at 12:27 PM, Jason Rutherglen <
jason.rutherglen@gmail.com> wrote:

> Is there an issue open or any particular reason that an MR job needs to
> access
> the HBase data directly from the region server? It seems possible to also
> provide functionality such that MR can execute over the HFile(s) stored in
> HDFS, thereby giving similar performance characteristics comparable to
> typical
> MR jobs that execute against files in HDFS.
>
> Jason
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message