lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alexandr Bocharov <bocharov.alexa...@gmail.com>
Subject Can Solr solve this simple problem?
Date Mon, 16 Apr 2012 20:30:55 GMT
Hi everyone :)
Our company is very interesting in Solr engine for searching people.
I have 3 questions below about extended capabilities of Solr, but first I'd
like to present you the problem

Let's say we have ~100 mln users with many characteristics - some of them
described below.
We want to search users by any set of these characteristics (of course we
should use index clustering, replication, query distribution)

   - country - text (alpha-iso-3 country code)
   - language - text (alpha-iso-3 country code)
   - has_photo - boolean
   - has_video - boolean
   - lastvisit - date
   - gender - int
   - age - int
   - latitude - float
   - longitude - float
   - height - int
   - updated - date
   - 100+ other boolean fields to store and search by it - profile has some
   property or don't

*
Prototype SQL query looks like this:*

SELECT user_id
FROM users
WHERE
    AND country = 'USA'
    AND language = 'SPA'
    AND gender = 1
    AND age BETWEEN 30 AND 40
    AND latitude BETWEEN 39.0 AND 41.0
    AND longitude BETWEEN 73.0 AND 75.0
    AND height BETWEEN 170 AND 180
    AND has_photo = 1
    AND has_video = 0
    AND (bool_field1 = 1 OR bool_field2 = 1)
    AND (bool_fieldN = 0 OR bool_fieldM = 1 OR bool_fieldK = 0)
    ...
ORDER BY
    IF(has_photo = 1, 100, 0) +
    IF(language = 'FRA', 50, 0) +
    IF(has_description = 1, 150, 0) +
    IF(has_video = 1, 50, 0) +
    IF(lastvisit > NOW() - interval 1 month, 200, 0) DESC,
    IF(age > 35, 20, 0) +
    IF(gender = 2, 30, 0) +
    IF(bool_field1 = 1, 50, 0) +
    IF(bool_fieldN = 0, 100, 0) ASC
LIMIT 200;

So, these are my 3 questions:
1. Does Solr provide searching among different count fields with different
types like in WHERE condition?
2. Does Solr provide such sorting, that depends on other fields (like sums
in ORDER BY), other words - does it provide any kind of function, which is
used to sort results from q1?
3. Does Solr provide realtime index updating or updating every N minutes?

What advices can you give to provide this scheme searching with Solr?

Best regards.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message