nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrzej Bialecki>
Subject Re: servlet
Date Wed, 23 Mar 2005 10:19:36 GMT
John X wrote:
> Hi, All,
> Attached please find servlet that serves raw Content
> of any mime type. Current cached.jsp handles mime type text/* only.
> If no objection, it is going to be committed in a few days.

I think this would be quite useful.

However, what I think is ultimately needed to match the features of 
other search engines is not the ability to return the cached non-html 
content (there might even be copyright issues with this function...), 
but an html rendering of non-html content, a la Google's "View as HTML" 

Best regards,
Andrzej Bialecki
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration  Contact: info at sigram dot com

View raw message