tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mattmann, Chris A (388J)" <chris.a.mattm...@jpl.nasa.gov>
Subject Re: Specify HTMLHandler via Context
Date Wed, 07 Jul 2010 15:11:10 GMT
+1, I think this is a good idea. Why not make it override-able and fallback on the (existing)
default mechanism for back compat and for API/end-user stability.


On 7/7/10 8:08 AM, "Julien Nioche" <lists.digitalpebble@gmail.com> wrote:

Hi guys,

One of the recent changes on Tika is the possibility to specify a custom
HTMLMapper via the Context - which I think is an elegant mechanism. I was
wondering whether there would be a reason NOT to be able to do the same for
the HTMLHandler and if nothing is passed via the Context, rely on the
current implementation. This would give more control to the user on what to
do with the SAX events while at the same time preserving the functionality
by default.

Any thoughts on this?


DigitalPebble Ltd

Open Source Solutions for Text Engineering

Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message