lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pablo Anzorena (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-8768) Wrong behaviour in json facets
Date Tue, 01 Mar 2016 16:51:18 GMT

    [ https://issues.apache.org/jira/browse/SOLR-8768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15174008#comment-15174008
] 

Pablo Anzorena commented on SOLR-8768:
--------------------------------------

UPDATE:
It does not depend on the range facets. The problem is on the faceting with stats across multiple
shards. Following with the example,
'sellers_2014'
Tyrion, 1
Jon, 50
PoorNed, 4

'sellers_2015'
Tyrion, 100
Jon, 50
PoorNed, 4

'sellers_2016'
Tyrion, 1
Jon, 50
PoorNed, 4

If the request is top 2 sellers order by seller_measure desc, the response would be:
{code}
1)Jon 150
2)Tyrion 100
{code}

But the correct response would be Tyrion with 102, but it is not counting the +2 because Tyrion
is not in the top 2 of the other two shards.

> Wrong behaviour in json facets
> ------------------------------
>
>                 Key: SOLR-8768
>                 URL: https://issues.apache.org/jira/browse/SOLR-8768
>             Project: Solr
>          Issue Type: Bug
>          Components: Facet Module
>            Reporter: Pablo Anzorena
>
> This bug is quite difficult to explain it, so I will first show it with an example and
then explain it.
> I have a core splitted into three shards, let's call them 'sellers_2014', 'sellers_2015',
'sellers_2016'.
> The schema has the following fields:
> seller_name, string
> seller_measure, double
> seller_date, date
> With the following data.
> 'sellers_2014'
> Tyrion, 1, 2014-01-01T00:00:00Z
> Jon, 50, 2014-01-01T00:00:00Z
> PoorNed, 4, 2014-01-01T00:00:00Z
> 'sellers_2015'
> Tyrion, 100, 2015-01-01T00:00:00Z
> Jon, 50, 2015-01-01T00:00:00Z
> PoorNed, 4, 2015-01-01T00:00:00Z
> 'sellers_2016'
> Tyrion, 1, 2015-01-01T00:00:00Z
> Jon, 50, 2015-01-01T00:00:00Z
> PoorNed, 4, 2015-01-01T00:00:00Z
> Request:
> http://localhost:8983/solr/sellers_2016/select?q=*:*&shards=localhost:8983/solr/sellers_2014,localhost:8983/solr/sellers_2015,localhost:8983/solr/sellers_2016&json.facet=
> {code}
> {
>   top_sellers: {
>     type: terms,
>     field: seller_name,
>     limit: 2,
>     offset: 0,
>     sort: "seller_measure desc",
>     facet: {
>       seller_measure: "sum(seller_measure)",
>       seller_dates: {
>         type: range,
>         field: seller_date,
>         start: "2014-01-01T00:00:00Z",
>         end: "2016-12-31T00:00:00Z",
>         gap: "+1YEARS",
>         facet: {
>           seller_measure: "sum(seller_measure)"
>         }
>       }
>     }
>   }
> }
> {code}
> So... With the request I want to know the top 2 sellers across the three shards and for
each seller, their seller_measure for each year.
> The response I'm getting is:
> {code}
> "val": "Jon",
> "count": 3,
> "seller_measure": 150,
> "seller_dates": {
>   "buckets": [
>     {
>       "val": "2014-01-01T00:00:00Z",
>       "count": 1,
>       "seller_measure": 50
>     },
>     {
>       "val": "2015-01-01T00:00:00Z",
>       "count": 1,
>       "seller_measure": 50
>     },
>     {
>       "val": "2016-01-01T00:00:00Z",
>       "count": 1,
>       "seller_measure": 50
>     }
>   ]
> },
> "val": "Tyrion",
> "count": 3,
> "seller_measure": 102,
> "seller_dates": {
>   "buckets": [
>     {
>       "val": "2015-01-01T00:00:00Z",
>       "count": 1,
>       "seller_measure": 100
>     }
>   ]
> }
> {code}
> which is incorrect, because the two buckets of 2014 and 2016 in Tyrion are missing.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message