lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From okayndc <>
Subject escaping HTML tags within XML file
Date Sun, 25 Sep 2011 13:00:22 GMT

Was wondering if it is necessary to escape HTML tags within an XML file for
indexing?  If so, seems like a large XML files with tons of HTML tags could
get really messy (using CDATA).
Has this been your experience?  Do you escape the HTML tags? If so, what
technique do you use? Or do you leave the HTML tags in place without
escaping them?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message