jackrabbit-oak-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alex Parvulescu (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (OAK-1804) TarMK compaction
Date Thu, 05 Jun 2014 10:12:02 GMT

     [ https://issues.apache.org/jira/browse/OAK-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Alex Parvulescu updated OAK-1804:

    Attachment: compaction-map-as-bytebuffer.patch

To address some of the memory related concerns, I've serialized the record id map to a byte
buffer. this should free the records to be GC'ed.

I'm still seeing the same growth problem unfortunately. The standalone tool works fine so
this points to some segments not being cleared right away because there are some references
to them, like the async indexing and some observation threads. It remains to be seen if the
repo will eventually decrease in size as the refs disappear, or if there's still something
weird going on.

> TarMK compaction
> ----------------
>                 Key: OAK-1804
>                 URL: https://issues.apache.org/jira/browse/OAK-1804
>             Project: Jackrabbit Oak
>          Issue Type: New Feature
>          Components: segmentmk
>            Reporter: Jukka Zitting
>            Assignee: Alex Parvulescu
>              Labels: production, tools
>             Fix For: 1.0.1, 1.1
>         Attachments: SegmentNodeStore.java.patch, compact-on-flush.patch, compaction-map-as-bytebuffer.patch,
compaction.patch, fast-equals.patch
> The TarMK would benefit from periodic "compact" operations that would traverse and recreate
(parts of) the content tree in order to optimize the storage layout. More specifically, such
compaction would:
> * Optimize performance by increasing locality and reducing duplication, both of which
improve the effectiveness of caching.
> * Allow the garbage collector to release more unused disk space by removing references
to segments where only a subset of content is reachable.

This message was sent by Atlassian JIRA

View raw message