hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From 曹楠楠 <michael.c...@gmail.com>
Subject About the memory file system, any suggestions?
Date Mon, 12 Oct 2009 05:37:35 GMT
Hi all :
I try to use the memory file system in hadoop. the idea is very simple. I
want to use memory file system to  the map intermediate file. It is like
this; 1. the memory is limited, the data will be written into the disk. 2.If
the file in memory is deleted and there are space in memory, the data will
be prefetched by a thread into the memory.3.If the data is not in memory,
then read it directly from disk.

But when I try to implement it in hadoop. I find that when the tasktracker
receive a new map or reduce task, it will start a new process. If I use the
memory file system, the intermediate file will be written into map task
process address space. And task tracker can't access to it. So any

Thanks a lot :)

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message