[ https://issues.apache.org/jira/browse/FALCON-325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14094925#comment-14094925
]
Sowmya Ramesh commented on FALCON-325:
--------------------------------------
On the source cluster there already exists a relationship between Feed entity, Source cluster
and Feed Instance as shown below
FeedEntity <-(instance-of) FeedInstance (stored-in) -> Srccluster
When replication policy is run; edge with relationship label replicated-to has to be added
between FeedInstance and TargetCluster.
Also FeedInstance vertex property has to be updated with timestamp of replication.
To improve the performance for the retrieval query : say get me all the replicated instances
for past 3 months
Add new REPLICATED property to FeedInstance vertex which was replicated. This would avoid
going and looking for out edge with label replicated-to to get the replicated instances.
Please comment if you have any concerns about this approach. Thanks!
> Process lineage information for Replication policies
> ----------------------------------------------------
>
> Key: FALCON-325
> URL: https://issues.apache.org/jira/browse/FALCON-325
> Project: Falcon
> Issue Type: Sub-task
> Affects Versions: 0.5
> Reporter: Venkatesh Seetharam
> Assignee: Sowmya Ramesh
> Labels: lineage
>
> The current patch only addresses process executions and not data lifecycle policies.
This task should address adding this information.
--
This message was sent by Atlassian JIRA
(v6.2#6252)
|