lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Adrien Grand (JIRA)" <>
Subject [jira] [Commented] (LUCENE-4599) Compressed term vectors
Date Fri, 07 Dec 2012 17:21:20 GMT


Adrien Grand commented on LUCENE-4599:

bq. I think we waste space with the terms, especially prefix/suffix lengths [..] these should
likely be bulk-compressed

Good point.

bq. flags are wasteful and stupid, but it seems like you already tried to address that to
some extent

I'm storing them in a packed ints array where each entry is 3 bits per value. I'll try to
optimize when a field always has the same flags.

> Compressed term vectors
> -----------------------
>                 Key: LUCENE-4599
>                 URL:
>             Project: Lucene - Core
>          Issue Type: Task
>          Components: core/codecs, core/termvectors
>            Reporter: Adrien Grand
>            Assignee: Adrien Grand
>            Priority: Minor
>             Fix For: 4.1
>         Attachments: LUCENE-4599.patch
> We should have codec-compressed term vectors similarly to what we have with stored fields.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message