lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF subversion and git services (JIRA)" <>
Subject [jira] [Commented] (LUCENE-5667) Optimize common-prefix across all terms in a field
Date Fri, 16 May 2014 11:24:06 GMT


ASF subversion and git services commented on LUCENE-5667:

Commit 1594846 from [~mikemccand] in branch 'dev/trunk'
[ ]

LUCENE-5667: add test case

> Optimize common-prefix across all terms in a field
> --------------------------------------------------
>                 Key: LUCENE-5667
>                 URL:
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Michael McCandless
>            Assignee: Michael McCandless
>             Fix For: 4.9, 5.0
> I tested different UUID sources in Lucene
> and I was surprised to see that Flake IDs were slower than UUID V1.
> They use the same raw sources of info (timestamp, node id, sequence
> counter) but Flake ID preserves total order by keeping the timestamp
> "intact" in the leading 64 bits.
> I think the reason might be because a Flake ID will typically have a
> longish common prefix for all docs, and I think we might be able to
> optimize this in block-tree by storing that common prefix outside of
> the FST, or maybe just pre-computing the common prefix on init and
> storing the "effective" start node for the FST.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message