tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Luís Filipe Nassif <lfcnas...@gmail.com>
Subject Re: Guidance to avoid Tika's integration with Solr's ExtractingRequestHandler in production
Date Tue, 29 May 2018 19:18:29 GMT
Hi Ken,

Threads will not help with OutOfMemoryErrors or crashes caused by native
libs. ForkParser can help, after the refactoring started by Tim to handle
some of its limitations. See TIKA-2653

2018-05-29 16:11 GMT-03:00 Ken Krugler <kkrugler_lists@transpac.com>:

> Thanks for the ref, Tim.
>
> I’m curious why SolrCell doesn’t fire up threads when parsing docs with
> Tika (or use the fork parser), to mitigate issues with hangs & crashes?
>
> — Ken
>
> > On May 29, 2018, at 11:54 AM, Tim Allison <tallison@apache.org> wrote:
> >
> > All,
> >
> >  Over the weekend, Shawn Heisey very kindly drafted a wikipage about the
> > challenges of using Solr's ExtractingRequestHandler and the guidance to
> > avoid it in production.
> >
> >   I completely agree with this point, and I think that Shawn did a very
> > nice job of capturing some of the challenges.  If you have any feedback
> or
> > would like to make edits, see:
> >
> > https://wiki.apache.org/solr/RecommendCustomIndexingWithTika
> >
> >   Cheers,
> >
> >                 Tim
>
> --------------------------------------------
> http://about.me/kkrugler
> +1 530-210-6378
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message