jackrabbit-oak-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chetan Mehrotra (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (OAK-2470) Support for maxExtractLength while parsing binaries with Tika
Date Mon, 02 Feb 2015 17:45:35 GMT

     [ https://issues.apache.org/jira/browse/OAK-2470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Chetan Mehrotra resolved OAK-2470.
----------------------------------
       Resolution: Fixed
    Fix Version/s:     (was: 1.2)
                   1.1.6

Fixed in trunk with http://svn.apache.org/r1656526

Config can be done via tika/@maxExtractLength. A negative value indicates 
a multiple of maxFieldLength and a positive value is used as is

So 
* maxExtractLength = -10, maxFieldLength = 10000 -> Actual value = 100000
* maxExtractLength = 1000 -> Actual value = 1000

This is similar to current JR2 support

> Support for maxExtractLength while parsing binaries with Tika
> -------------------------------------------------------------
>
>                 Key: OAK-2470
>                 URL: https://issues.apache.org/jira/browse/OAK-2470
>             Project: Jackrabbit Oak
>          Issue Type: Improvement
>          Components: oak-lucene
>            Reporter: Chetan Mehrotra
>            Assignee: Chetan Mehrotra
>            Priority: Minor
>             Fix For: 1.1.6
>
>
> JR2 had support for {{maxExtractLength}} [1] which restricts the number of charachters
extracted by Tika parser. Similar support should be provided in Oak Lucene
> [1] https://github.com/apache/jackrabbit/blob/trunk/jackrabbit-core/src/main/java/org/apache/jackrabbit/core/query/lucene/SearchIndex.java#L2074



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message