tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Carsten Ziegeler <cziege...@apache.org>
Subject Re: Support for document libraries
Date Tue, 10 Jul 2007 08:00:01 GMT
Bertrand Delacretaz wrote:
> On 7/10/07, Carsten Ziegeler <cziegeler@apache.org> wrote:
>> ... Although Tika is more the framework for plugin in such stuff, it
>> perhaps
>> makes sense to try to start something like that as sub projects of
>> Tika?...
> I would agree, although IMHO Tika should reuse existing libraries as
> much as possible.
Yes, it doesn't make sense to reinvent the wheel if there are
good-enough libraries out there. But afaik for several formats there
aren't suitable libs available, so these are the cases where I think
that it makes sense to "drag them in".

> In some cases, the Tika part could just consist of automated tests for
> existing libraries, to help in selecting and validating them.
> -Bertrand

Carsten Ziegeler

View raw message