lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Rutherglen (JIRA)" <>
Subject [jira] Commented: (LUCENE-2897) apply delete-by-Term and docID immediately to newly flushed segments
Date Sat, 29 Jan 2011 16:33:44 GMT


Jason Rutherglen commented on LUCENE-2897:

bq. Well... I think we can't [easily] skip writing the postings, because could result in non-deterministic
behavior (I put a comment on this in the patch).

Instead we're building the deleted docs BV as we're flushing.

> apply delete-by-Term and docID immediately to newly flushed segments
> --------------------------------------------------------------------
>                 Key: LUCENE-2897
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Improvement
>            Reporter: Michael McCandless
>            Assignee: Michael McCandless
>             Fix For: 3.2, 4.0
>         Attachments: LUCENE-2897.patch
> Spinoff from LUCENE-2324.
> When we flush deletes today, we keep them as buffered Term/Query/docIDs that need to
be deleted.  But, for a newly flushed segment (ie fresh out of the DWPT), this is silly, because
during flush we visit all terms and we know their docIDs.  So it's more efficient to apply
the deletes (for this one segment) at that time.
> We still must buffer deletes for all prior segments, but these deletes don't need to
map to a docIDUpto anymore; ie we just need a Set.
> This issue should wait until LUCENE-1076 is in since that issue cuts over buffered deletes
to a transactional stream.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message