tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Luís Filipe Nassif <lfcnas...@gmail.com>
Subject Re: Guidance to avoid Tika's integration with Solr's ExtractingRequestHandler in production
Date Tue, 29 May 2018 19:21:58 GMT
Related to this, do we have any guidance to help java users choosing
between ForkParser or TikaServer?

2018-05-29 16:18 GMT-03:00 Luís Filipe Nassif <lfcnassif@gmail.com>:

> Hi Ken,
>
> Threads will not help with OutOfMemoryErrors or crashes caused by native
> libs. ForkParser can help, after the refactoring started by Tim to handle
> some of its limitations. See TIKA-2653
>
> 2018-05-29 16:11 GMT-03:00 Ken Krugler <kkrugler_lists@transpac.com>:
>
>> Thanks for the ref, Tim.
>>
>> I’m curious why SolrCell doesn’t fire up threads when parsing docs with
>> Tika (or use the fork parser), to mitigate issues with hangs & crashes?
>>
>> — Ken
>>
>> > On May 29, 2018, at 11:54 AM, Tim Allison <tallison@apache.org> wrote:
>> >
>> > All,
>> >
>> >  Over the weekend, Shawn Heisey very kindly drafted a wikipage about the
>> > challenges of using Solr's ExtractingRequestHandler and the guidance to
>> > avoid it in production.
>> >
>> >   I completely agree with this point, and I think that Shawn did a very
>> > nice job of capturing some of the challenges.  If you have any feedback
>> or
>> > would like to make edits, see:
>> >
>> > https://wiki.apache.org/solr/RecommendCustomIndexingWithTika
>> >
>> >   Cheers,
>> >
>> >                 Tim
>>
>> --------------------------------------------
>> http://about.me/kkrugler
>> +1 530-210-6378
>>
>>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message