falcon-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sowmya Ramesh (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FALCON-325) Process lineage information for Replication policies
Date Wed, 13 Aug 2014 00:04:12 GMT

    [ https://issues.apache.org/jira/browse/FALCON-325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14094925#comment-14094925

Sowmya Ramesh commented on FALCON-325:

On the source cluster there already exists a relationship between Feed entity, Source cluster
and Feed Instance as shown below

FeedEntity <-(instance-of) FeedInstance (stored-in) -> Srccluster

When replication policy is run; edge with relationship label replicated-to has to be added
between FeedInstance and TargetCluster. 
Also FeedInstance vertex property has to be updated with timestamp of replication. 

To improve the performance for the retrieval query : say get me all the replicated instances
for past 3 months
Add new REPLICATED property to FeedInstance vertex which was replicated. This would avoid
going and looking for out edge with label replicated-to to get the replicated instances.

Please comment if you have any concerns about this approach. Thanks!       

> Process lineage information for Replication policies
> ----------------------------------------------------
>                 Key: FALCON-325
>                 URL: https://issues.apache.org/jira/browse/FALCON-325
>             Project: Falcon
>          Issue Type: Sub-task
>    Affects Versions: 0.5
>            Reporter: Venkatesh Seetharam
>            Assignee: Sowmya Ramesh
>              Labels: lineage
> The current patch only addresses process executions and not data lifecycle policies.
This task should address adding this information.

This message was sent by Atlassian JIRA

View raw message