tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mattmann, Chris A (388J)" <chris.a.mattm...@jpl.nasa.gov>
Subject Re: Additional search for tika.a.o
Date Tue, 10 Aug 2010 04:11:35 GMT
+1 to Jukka's comments.


On 8/9/10 3:13 PM, "Jukka Zitting" <jukka.zitting@gmail.com> wrote:


On Sun, Aug 8, 2010 at 7:06 AM, Otis Gospodnetic
<otis_gospodnetic@yahoo.com> wrote:
> Would the community be interested in a patch that adds another search option to
> the search box on tika.apache.org?

Sure. I think the earlier consensus within the Lucene PMC was that any
reasonable Lucene-based search engines would be given equal standing
as our site search providers. It's a bit unfortunate that this may
lead to some inconsistency in the user interface, but I think we can
live with that until someone steps up to implement and maintain such a
search engine on Apache hardware.

> Assuming people are for this, any suggestions for how the search should function
> by default or any specific instructions for how the search box should be
> modified would be great!

Ideally the search box would allow the user to choose which provider
to use. Something like a cookie that remembers the user's selection
would be nice. If the user doesn't make an explicit selection of the
provider, then one should be selected randomly for the first search
and remembered afterwards for consistency.

It would be great if the required logic was implemented on the client
side using javascript, as otherwise we'd need to start messing up with
CGI scripts, etc.

The current site template can be found at [1].

[1] https://svn.apache.org/repos/asf/tika/site/src/site/site.vm


Jukka Zitting

Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message