atlas-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF subversion and git services (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ATLAS-3132) Data Patch Fx: Improve Data Patching Performance
Date Tue, 16 Apr 2019 09:22:00 GMT

    [ https://issues.apache.org/jira/browse/ATLAS-3132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16818818#comment-16818818
] 

ASF subversion and git services commented on ATLAS-3132:
--------------------------------------------------------

Commit efc4bebc1623c9d00fe4fdf0df424918654a73df in atlas's branch refs/heads/master from Ashutosh
Mestry
[ https://gitbox.apache.org/repos/asf?p=atlas.git;h=efc4beb ]

ATLAS-3132: performance improvements in UniqueAttributesPatch

Signed-off-by: Madhan Neethiraj <madhan@apache.org>


> Data Patch Fx: Improve Data Patching Performance
> ------------------------------------------------
>
>                 Key: ATLAS-3132
>                 URL: https://issues.apache.org/jira/browse/ATLAS-3132
>             Project: Atlas
>          Issue Type: Improvement
>          Components:  atlas-core
>    Affects Versions: trunk
>            Reporter: Ashutosh Mestry
>            Assignee: Ashutosh Mestry
>            Priority: Major
>             Fix For: trunk
>
>
> *Background*
> The Java patch framework (now called data patching framework) introduced recently performs
patching at the rate of 1 million entities per 15 hrs. This can be improved.
> *Proposed Solution*
>  * Use the Producer-Consumer framework to spawn multiple workers to perform concurrent
updates to entity vertices.
>  * Use _AtlasGraph_ in bulk loading mode to further gain performance.
>  * Perform duplicate data checks during processing.
> *Projected Performance Improvement*
>  * Based on various tests, these give increased throughput. New rate can be ~300K entities
per 5 mins.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message