lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Martynas L <martynas....@gmail.com>
Subject Re: Slower document retrieval in 8.7.0 comparing to 7.5.0
Date Tue, 22 Dec 2020 14:05:15 GMT
Query is fast, but document retrieval is "slow".
We call:
1) IndexSearcher#search(Query, Collector) to collect docIDs, and then
2) retrieve documents with IndexSearcher#doc(int).

In our case (1) takes less than 0.5s, while (2) almost 1.5 min (4 times
slower than 7.5.0)

On Tue, Dec 22, 2020 at 3:23 PM Vincenzo D'Amore <v.damore@gmail.com> wrote:

> I think it would be useful to have an example of a document and, if
> possible, an example of query that takes too long.
>
> On Mon, Dec 21, 2020 at 1:47 PM Martynas L <martynas.sub@gmail.com> wrote:
>
> > Hello,
> >
> > I am sorry for the delay.
> >
> > Not sure what you mean by "workload". We have a performance tests, which
> > started failing after upgrading to 8.7.0.
> > So I just tried to query the index (built form the same source) to get
> all
> > documents and compare the performance with 7.5.0.
> >
> > Document "size" is a sum of all stored string lengths (3402519
> documents):
> >
> > doc size 903 - 88s vs 22s
> >
> > doc size 36 (only one field loaded, used searcher.doc(docID,
> > Collections.singleton("fieldName"))) - 78s vs 16s
> >
> > doc size 439 (some fields made not stored) - 46s vs 14.5s
> >
> > Best regards,
> > Martynas
> >
> > On Fri, Dec 4, 2020 at 12:06 AM Adrien Grand <jpountz@gmail.com> wrote:
> >
> > > Hello Martynas,
> > >
> > > There have indeed been changes related to stored fields in 8.7. What
> does
> > > your workload look like and how large are your documents on average?
> > >
> > > On Thu, Dec 3, 2020 at 3:04 PM Martynas L <martynas.sub@gmail.com>
> > wrote:
> > >
> > > > Hi,
> > > > We've migrated from 7.5.0 to 8.7.0 and find out that the index
> > > "searching"
> > > > is significantly (4-5 times) slower in the latest version.
> > > > It seems that
> > > > org.apache.lucene.search.IndexSearcher#doc(int)
> > > > is slower.
> > > >
> > > > Is it possible to have similar performance with 8.7.0?
> > > >
> > > > Best regards,
> > > > Martynas
> > > >
> > >
> > >
> > > --
> > > Adrien
> > >
> >
>
>
> --
> Vincenzo D'Amore
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message