lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Koji Sekiguchi <k...@r.email.ne.jp>
Subject Re: HTML Stripping slower in Solr 1.4?
Date Sat, 05 Dec 2009 16:03:47 GMT
Yonik Seeley wrote:
> Is BaseCharFilter required for the html strip filter?
>
> -Yonik
> http://www.lucidimagination.com
>
>   
It could be if HTMLStripCharFilter is reverted to first version.
The first version of HTMLStripCharFilter, for example,
if we have "<p>aaa", it produces "   aaa" (3 space chars prior
to aaa). But after committed SOLR-1394, it produces " aaa"
(1 space) and now it uses correct() method of BaseCharFilter
to correct offsets.

Koji

-- 
http://www.rondhuit.com/en/


Mime
View raw message