jackrabbit-oak-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Thomas Mueller (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (OAK-2466) DataStoreBlobStore: chunk ids should not contain the size
Date Wed, 22 Jul 2015 11:51:05 GMT

     [ https://issues.apache.org/jira/browse/OAK-2466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Thomas Mueller updated OAK-2466:
    Fix Version/s:     (was: 1.3.4)

> DataStoreBlobStore: chunk ids should not contain the size
> ---------------------------------------------------------
>                 Key: OAK-2466
>                 URL: https://issues.apache.org/jira/browse/OAK-2466
>             Project: Jackrabbit Oak
>          Issue Type: Improvement
>          Components: core
>            Reporter: Thomas Mueller
>            Assignee: Thomas Mueller
>              Labels: datastore, performance
>             Fix For: 1.4
> The blob store garbage collection (data store garbage collection) uses the chunk ids
to identify binaries to be deleted. The blob ids contain the size now (<contentHash>#<size>),
and the blob id is currently equal to the chunk id.
> It would be more efficient to _not_ use the size, and instead just use the content hash,
for the chunk ids. That way, enumerating the entries that are in the store is potentially
faster. Also, it allows us to change the blob id in the future, for example add more information
to it (for example the creation time, or the first few bytes of the content) if we ever want

This message was sent by Atlassian JIRA

View raw message