jackrabbit-oak-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Dürig (JIRA) <j...@apache.org>
Subject [jira] [Commented] (OAK-3177) Compaction slow on repository with continuous writes
Date Thu, 06 Aug 2015 14:51:05 GMT

    [ https://issues.apache.org/jira/browse/OAK-3177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14660113#comment-14660113

Michael Dürig commented on OAK-3177:

OAK-2734 needs to be merged first should we decide to merge this.

> Compaction slow on repository with continuous writes
> ----------------------------------------------------
>                 Key: OAK-3177
>                 URL: https://issues.apache.org/jira/browse/OAK-3177
>             Project: Jackrabbit Oak
>          Issue Type: Sub-task
>          Components: segmentmk
>            Reporter: Michael Dürig
>            Assignee: Michael Dürig
>              Labels: compaction, gc
>             Fix For: 1.3.4
>         Attachments: OAK-3177.patch, OAK-3177.png
> OAK-2734 introduced retry cycles and the option to force compaction when all cycles fail.
However OAK-2192 introduced a performance regression: each compaction cycle takes in the order
of the size of the repository to complete instead of in the order of the number of remaining
changes to compact. This is caused by comparing compacted with pre-compacted node states,
which is necessary to avoid mixed segments (aka OAK-2192). To fix the performance regression
I propose to pass the compactor an additional node state (the 'onto' state). The diff would
then be calculated across the pre compacted states, which performs in the order of number
of changes. The changes would then be applied to the 'onto' state, which is a compacted state
to avoid mixed segments. 

This message was sent by Atlassian JIRA

View raw message