lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hausherr, Jens" <jens.haush...@logicacmg.com>
Subject AW: What is the best way to index xml data preserving the mark up?
Date Thu, 08 Nov 2007 09:03:02 GMT
Hi, 

if you just need to preserve the xml for storing you could simply wrap the xml markup in CDATA.
Splitting your structure beforehand and using dynamic fields might be a viable solution...

eg. 
<add>
  <doc>
    <field name="foo1">value 1</field>
    <field name="foo2">value 2</field>
    ....
    <field name="content"><![CDATA[<an xml stream with embedded source markup>]]></field>
  </doc>
</add>


 

Mit freundlichen Grüßen / Best Regards / Avec mes meilleures salutations

 
Jens Hausherr 
 
Dipl.-Wirtsch.Inf. (Univ.) 
Senior Consultant 
 
Tel: 040-27071-233
Fax: 040-27071-244
Fax: +49-(0)178-998866-097
Mobile: +49-(0)178-8866-097
 
mailto: mailto:jens.hausherr@logicacmg.com <mailto:jens.hausherr@unilog.logicacmg.com>

 
Unilog Avinci - a LogicaCMG company
Am Sandtorkai 72
D-20457 Hamburg
http://www.unilog.de <http://www.unilog.de/> 
 
Unilog Avinci GmbH
Zettachring 4, 70567 Stuttgart
Amtsgericht Stuttgart HRB 721369
Geschäftsführer: Torsten Straß / Eric Guyot / Rudolf Kuhn / Olaf Scholz
 


This e-mail and any attachment is for authorised use by the intended recipient(s) only. It
may contain proprietary material, confidential information and/or be subject to legal privilege.
It should not be copied, disclosed to, retained or used by, any other party. If you are not
an intended recipient then please promptly delete this e-mail and any attachment and all copies
and inform the sender. Thank you.

Mime
  • Unnamed multipart/mixed (inline, None, 0 bytes)
View raw message