jackrabbit-oak-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chetan Mehrotra (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (OAK-2469) Restrict the maximum number of terms that will be indexed for a single field
Date Mon, 02 Feb 2015 09:27:34 GMT

     [ https://issues.apache.org/jira/browse/OAK-2469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Chetan Mehrotra updated OAK-2469:
---------------------------------
    Attachment: OAK-2469.patch

Patch for the same.

So far with Lucene 4.x my understanding was there was no actual limit on number of token for
a given field. While for 3.x it was probably 10000 and with LUCENE-2295 this change moved
to analyzer and by default it was again unlimited

However JR2 used to configure a {{LimitTokenCountAnalyzer}} with 10000 as default.

So with this change we would be changing the default from unlimited -> 10000. We can change
the default to unlimited however I would prefer we specify a default of 10000

[~alex.parvulescu] [~teofili]  [~mreutegg] Thoughts?

> Restrict the maximum number of terms that will be indexed for a single field
> ----------------------------------------------------------------------------
>
>                 Key: OAK-2469
>                 URL: https://issues.apache.org/jira/browse/OAK-2469
>             Project: Jackrabbit Oak
>          Issue Type: Improvement
>          Components: oak-lucene
>            Reporter: Chetan Mehrotra
>            Assignee: Chetan Mehrotra
>            Priority: Minor
>             Fix For: 1.2
>
>         Attachments: OAK-2469.patch
>
>
> JR2 used support a {{maxFieldLength}} [1] to limit the numbers of terms indexed per field
with default value of 10000. Similar support should be provided in Oak Lucene
> [1] https://github.com/apache/jackrabbit/blob/trunk/jackrabbit-core/src/main/java/org/apache/jackrabbit/core/query/lucene/SearchIndex.java#L299



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message