tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Carsten Ziegeler <cziege...@apache.org>
Subject Re: Questions
Date Mon, 02 Jul 2007 10:05:36 GMT
Bertrand Delacretaz wrote:
> On 6/30/07, Grant Ingersoll <gsingers@apache.org> wrote:
>> ...My main concern w/ extracting Nutch is all the dependencies on
>> Hadoop, etc.  But it does seem like the shortest path for me....
> I've mentioned Tika to a few colleagues lately, and one thing that
> comes up often is that there are many document/format parsing
> libraries around, which should ideally be usable as Tika plugins with
> as little changes as possible.
> But these libraries' dependencies are all around the place, and
> probably conflicting in many cases.
> It might be good to take that into account in the design of Tika, and
> use solid classloading and isolation mechanisms. OSGI comes to mind,
> assuming it doesn't bloat the whole thing.
Yes, in many cases a solid classloading mechanism is a must and OSGi 
definitly implements this properly.
I think, we can leave this open (= do not need to require OSGi) if we 
have an open way of registering the plugins. Registering in an OSGi 
environment might then be slightly different compared to registering in 
a non OSGi environmnent. Of course, using the latter one might result in 
classloading problems :) But then it's up to the developer to decide in 
which environment tika should run with all the pros and cons that come 
with this decision.


View raw message