lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF subversion and git services (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-5798) minor optimizations to MultiDocs(AndPositions)Enum.reset()
Date Tue, 01 Jul 2014 11:53:26 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-5798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14048784#comment-14048784
] 

ASF subversion and git services commented on LUCENE-5798:
---------------------------------------------------------

Commit 1607055 from [~rcmuir] in branch 'dev/branches/branch_4x'
[ https://svn.apache.org/r1607055 ]

LUCENE-5798: Optimize MultiDocsEnum reuse

> minor optimizations to MultiDocs(AndPositions)Enum.reset()
> ----------------------------------------------------------
>
>                 Key: LUCENE-5798
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5798
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Robert Muir
>             Fix For: 5.0, 4.10
>
>         Attachments: LUCENE-5798.patch
>
>
> This method is called by merging for each term, potentially many times, but only returning
a few docs for each invocation (e.g. imagine high cardinality fields, unique id fields, normal
zipf distribution on full text).
> Today we create a new EnumWithSlice[] array and new EnumWithSlice entry for each term,
but this creates a fair amount of unnecessary garbage: instead we can just make this array
up-front as size subReaderCount and reuse it.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message