jackrabbit-oak-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alex Parvulescu (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (OAK-1877) Hourly async reindexing on an idle instance
Date Mon, 16 Jun 2014 13:44:01 GMT

     [ https://issues.apache.org/jira/browse/OAK-1877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Alex Parvulescu updated OAK-1877:
---------------------------------

    Attachment: updates-without-indexed-changes.patch

there's a regression from the latest changes: content changes that are not indexed would generate
a new checkpoint, but fail to properly link it from the 'async' reference, resulting in a
large number of warning messages because of the missing initial checkpoint.

attached a proposed patch with some updates tests that also check the 'async' reference to
the checkpoints.

[~jukkaz] can you take a look?

> Hourly async reindexing on an idle instance
> -------------------------------------------
>
>                 Key: OAK-1877
>                 URL: https://issues.apache.org/jira/browse/OAK-1877
>             Project: Jackrabbit Oak
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 1.0
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>            Priority: Critical
>             Fix For: 1.0.1, 1.1
>
>         Attachments: updates-without-indexed-changes.patch
>
>
> OAK-1292 introduced the following interesting but not very nice behavior:
> On an idle system with no changes for an extended amount of time, the OAK-1292 change
blocks the async indexer from updating the reference to the last indexed checkpoint. After
one hour (the default checkpoint lifetime), the referenced checkpoint will expire, and the
indexer will fall back to full reindexing.
> The result of this behavior is that once every hour, the size of an idle instance will
grow with dozens or hundreds of megabytes of new index data generated by reindexing. Older
index data becomes garbage, but the compaction code from OAK-1804 is needed to make it collectable.
A better solution would be to prevent the reindexing from happening in the first place.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message