lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Imbeault <michael.imbea...@sympatico.ca>
Subject Re: Facet performance with heterogeneous 'facets'?
Date Tue, 19 Sep 2006 02:14:37 GMT
Just a little follow-up - I did a little more testing, and the query 
takes 20 seconds no matter what - If there's one document in the results 
set, or if I do a query that returns all 130000 documents.

It seems something isn't right... it looks like solr is doing faceted 
search on the whole index no matter what's the result set when doing 
facets on a string field. I must be doing something wrong?

Michael Imbeault
CHUL Research Center (CHUQ)
2705 boul. Laurier
Ste-Foy, QC, Canada, G1V 4G2
Tel: (418) 654-2705, Fax: (418) 654-2212



Michael Imbeault wrote:
> Been playing around with the news 'facets search' and it works very 
> well, but it's really slow for some particular applications. I've been 
> trying to use it to display the most frequent authors of articles; 
> this is from a huge (15 millions articles) database and names of 
> authors are rare and heterogeneous. On a query that takes (without 
> facets) 0.1 seconds, it jumps to ~20 seconds with just 1% of the 
> documents indexed (I've been getting java.lang.OutOfMemoryError with 
> the full index). ~40 seconds for a faceted search on 2 (string) 
> fields. Range queries on a slong field is more acceptable (even with a 
> dozen of them, query time is still in the subsecond range).
>
> I'm I trying to do something which isn't what faceted search was made 
> for? It would be understandable, after all, I guess the facets engine 
> has to check very doc in the index and sort... which shouldn't yield 
> good performance no matter what, sadly.
>
> Is there any other way I could achieve what I'm trying to do? Just a 
> list of the most frequent (top 5) authors present in the results of a 
> query.
>
> Thanks,
>

Mime
View raw message