falcon-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "sandeep samudrala (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (FALCON-1368) Improve Falcon server restart time
Date Wed, 05 Aug 2015 03:26:05 GMT

     [ https://issues.apache.org/jira/browse/FALCON-1368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

sandeep samudrala reassigned FALCON-1368:

    Assignee: sandeep samudrala

> Improve Falcon server restart  time
> -----------------------------------
>                 Key: FALCON-1368
>                 URL: https://issues.apache.org/jira/browse/FALCON-1368
>             Project: Falcon
>          Issue Type: Improvement
>            Reporter: Ajay Yadava
>            Assignee: sandeep samudrala
> Currently on restart, Falcon server loads all the entities from HDFS one by one. In a
large set up like the one at Inmobi, where we have several thousands of feeds and processes,
this takes several minutes to increase the start up time.
> Since this is an IO intensive task(reading a file from HDFS into memory), having multiple
threads to load entities in parallel will improve the start up time of the server. 
> Two points need to be taken care of:
> 1. Only a single type of entities should be load in parallel to preserve the order of
load of entities.
> 2. Currently Falcon server fails to start if there is an error in loading any entity,
it will be slightly tricky but we should preserve the same behaviour in case of parallel uploads
via threads.

This message was sent by Atlassian JIRA

View raw message