www-announce mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From jim...@apache.org
Subject [ANNOUNCE] Apache Lucene 8.0.0 released
Date Thu, 21 Mar 2019 15:07:18 GMT
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

14 March 2019, Apache Luceneā„¢ 8.0.0 available

The Lucene PMC is pleased to announce the release of Apache Lucene 8.0.0.

Apache Lucene is a high-performance, full-featured text search engine
library written entirely in Java. It is a technology suitable for nearly
any application that requires full-text search, especially cross-platform.

This release contains numerous bug fixes, optimizations, and improvements,
some of which are highlighted below. The release is available for immediate
download at:

 http://lucene.apache.org/core/downloads.html

Lucene 8.0.0 Release Highlights:

Query execution
Term queries, phrase queries and boolean queries introduced new
optimization that enables efficient skipping over non-competitive documents
when the total hit count is not needed. Depending on the exact query and
data distribution, queries might run between a few percents slower and many
times faster, especially term queries and pure disjunctions.
In order to support this enhancement, some API changes have been made:
 * TopDocs.totalHits is no longer a long but an object that gives a lower
bound of the actual hit count.
 * IndexSearcher's search and searchAfter methods now only compute total
hit counts accurately up to 1,000 in order to enable this optimization by
default.

 * Queries are now required to produce non-negative scores.

Codecs
 * Postings now index score impacts alongside skip data. This is how term
queries optimize collection of top hits when hit counts are not needed.
 * Doc values introduced jump tables, so that advancing runs in constant
time. This is especially helpful on sparse fields.
 * The terms index FST is now loaded off-heap for non-primary-key fields
using MMapDirectory, reducing heap usage for such fields.

Custom scoring
The new FeatureField allows efficient integration of static features such
as a pagerank into the score. Furthermore, the new
LongPoint#newDistanceFeatureQuery and LatLonPoint#newDistanceFeatureQuery
methods allow boosting by recency and geo-distance respectively. These new
helpers are optimized for the case when total hit counts are not needed.
For instance if the pagerank has a significant weight in your scores, then
Lucene might be able to skip over documents that have a low pagerank value.

Further details of changes are available in the change log available at:
http://lucene.apache.org/core/8_0_0/changes/Changes.html

Please report any feedback to the mailing lists
(http://lucene.apache.org/core/discussion.html)

Note: The Apache Software Foundation uses an extensive mirroring network
for distributing releases. It is possible that the mirror you are using may
not have replicated the release yet. If that is the case, please try
another mirror. This also applies to Maven access.
-----BEGIN PGP SIGNATURE-----
Version: FlowCrypt 6.6.7 Gmail Encryption
Comment: Seamlessly send and receive encrypted email

wsFcBAEBCAAGBQJck6ikAAoJEPmME8+loTXYe9IQAIObndDG2H4pNT8YE345
3KJDBgvtm9Ge3yIpdXQd5M7f8SJKKiHSK+UCJP+5CfWKakm4Ook5XlAKuIrN
GEcIxaGgkvkizYOZI6nkRrJsfE5dFVofZcOp/5+dhoqXx7AlQfkIGH8kOEoD
7GyJlAxy4Nh9SbhELjWeN6scThGzHGOlvAflcUFh8OyDw4hNyUkyI34SARtA
cqG4uGQ9fUIoDw8tTVn8YtnwIdgkO7ON2fV2HklICJ/YDOof7+lZmZEg8h4R
RkAkT/hW68cTvDwZWmmWMLOq0dY35gNrIOule/gir+djc5HymUd0l7CWBd5/
L7Iorr5eXk77nDeCcTKOQEtfcyCNINi8Pkmlyi/mAPHEQDCax3QpqaT6+++7
Tnm4A9/4vFD0dpswAe0EJ2/p4VfU4hV0GPiD5VB67gk4B+gv+wnRD901Gt1w
qt7cJXDkZJsUMwyId0uuNAJq15pWluT+1fBFAjW6UOWrAMgYUvXbZ8I8UbbC
UDBTROd6DJzmVlN7Sf9zIQ0OWs6+cf7SH2+Uuv/oxlooD18Cw9TZdC7FxfFY
h7Mdtvev3JNzXNOWWhhL5QrNPdkFjXJRrjOhzW+NR6uK1ohIyE8rkNLlNaBC
MBufZYoSpUu/HQQ4lo8rnvmEMsCde/UHuLxC8+bwsptcwIl1BDIwwjv0mi3Y
n1hv
=kRTR
-----END PGP SIGNATURE-----

Mime
View raw message