lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shalin Shekhar Mangar <shalinman...@gmail.com>
Subject Re: Problems with DIH XPath flatten
Date Tue, 06 Oct 2009 21:04:33 GMT
On Tue, Oct 6, 2009 at 9:29 PM, Adam Foltzer <acfoltzer@gmail.com> wrote:

> Hi all,
>
> I'm trying to set up DataImportHandler to index some XML documents
> available
> over web services. The XML includes both content and metadata, so for the
> indexable content, I'm trying to just index everything under the content
> tag:
>
> <entity dataSource="kbws" name="kbxml" pk="title"
>        url="resturl" processor="XPathEntityProcessor"
>        forEach="/document" transformer="HTMLStripTransformer"
> flatten="true">
> <field column="content" name="content" xpath="/document/kbml/body"
> flatten="true" stripHTML="true" />
> <field column="title" name="title" xpath="/document/kbml/kbq" />
> </entity>
>
> The result of this is that the title field gets populated and indexed
> (there
> are no child nodes of /document/kbml/kbq), but content does not get indexed
> at all. Since /document/kbml/body has many children, I expected that
> flatten="true" would store all of the body text in the field. Instead, it
> stores nothing at all. I've tried this with many combinations of
> transformers and flatten options, and the result is the same each time.
>
>
Which Solr version are you using? The flatten attribute was introduced after
1.3 released.

-- 
Regards,
Shalin Shekhar Mangar.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message