lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SOLR-5512) Optimize DocValuesFacets
Date Thu, 28 Nov 2013 05:53:35 GMT

     [ https://issues.apache.org/jira/browse/SOLR-5512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Robert Muir updated SOLR-5512:
------------------------------

    Attachment: SOLR-5512.patch

patch.

also fixes a few unrelated bugs that got in the way:
* BitDocSet's DISI's cost() method doesnt prorate for the size of the segment relative to
the whole thing (since the bitset is unfortunately top-level)
* If you hit an exception during faceting, the stacktrace is completely lost because of a
bug in SimpleFacets (from SOLR-2548)

> Optimize DocValuesFacets 
> -------------------------
>
>                 Key: SOLR-5512
>                 URL: https://issues.apache.org/jira/browse/SOLR-5512
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Robert Muir
>         Attachments: SOLR-5512.patch
>
>
> This works well in the general case (esp with huge numbers of unique values), but the
SortedSetDocValuesAccumulator in lucene/facets does the algorithm better for typical cases
(smaller number of unique values wrt the size of the document set).
> In this case, it collects directly with per-segment local ords, then remaps as a second
step. So this is a lot less remapping.
> Its too bad the code is separate at the moment, for now lets steal the heuristic.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message