lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mike Sokolov <>
Subject Re: What is correct use of HTMLStripCharFilter in Solr 3.1
Date Thu, 12 May 2011 19:52:42 GMT
It preserves the location of the terms in the original HTML document so 
that you can highlight terms in HTML.  This makes it possible (for 
instance) to display the entire document, with all the search terms 
highlighted, or (with some careful surgery) to display formatted HTML 
(bold, italic, etc) in your search results.


On 05/12/2011 03:42 PM, Jonathan Rochkind wrote:
> On 5/12/2011 2:55 PM, Ahmet Arslan wrote:
>>> I recently upgraded from Solr 1.3 to Solr 3.1 in order to
>>> take advantage of
>>> the HTMLStripCharFilter. But it isn't working as I
>>> expected.
>> You need to strip html tag before analysis phase. If you are using 
>> DIH, you can use stripHTML="true" transformer.
> Wait, then what's the HTMLStripCharFilter for?

View raw message