tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris Mattmann <chris.mattm...@jpl.nasa.gov>
Subject Re: Third Tika report
Date Wed, 13 Jun 2007 13:45:04 GMT
Jukka,

 Thanks for putting this together. +1, looks great, as usual.

Cheers,
  Chris



On 6/12/07 12:48 PM, "Jukka Zitting" <jukka.zitting@gmail.com> wrote:

> Hi,
> 
> This is my draft for the third Tika report. This report completest the
> initial three-month period after which we will be reporting only once
> per quarter.
> 
> I'm hoping that we could have our first release out by the next report
> in September, but I guess it's safer not to set any expectations in
> the official report at this point.
> 
> <report>
> Tika is a toolkit for detecting and extracting metadata and structured
> text content from various documents using existing parser libraries.
> Tika entered incubation on March 22nd, 2007.
> 
> Community
> 
> The Tika mailing lists have been relatively quiet lately, probably
> because with little code we don't yet have many concrete issues to
> talk about.
> 
> Development
> 
> We saw the first piece of Tika code when Chris A. Mattmann ported the
> Nutch metadata framework to Tika. Rida Benjelloun is currently working
> on bringing Lius code into Tika but the initial commits on that front
> have not yet happened.
> 
> Issues before graduation
> 
> The Tika project is still at an early stage of incubation. We need to
> continue bringing in the initial codebases and probably target for an
> initial incubating release later this year. We also need to work on
> growing the community and figuring out how to best interact with
> external parser projects.
> </report>
> 
> BR,
> 
> Jukka Zitting



Mime
View raw message