lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Adam Foltzer <acfolt...@gmail.com>
Subject Re: Problems with DIH XPath flatten
Date Tue, 06 Oct 2009 21:13:21 GMT
Hi Shalin,

Good question; sorry I forgot it in the initial post. I have tried with both
a nightly build from earlier this month (Oct 2 I believe) as well as a build
from the trunk as of yesterday afternoon.

Adam

On Tue, Oct 6, 2009 at 5:04 PM, Shalin Shekhar Mangar <
shalinmangar@gmail.com> wrote:

> On Tue, Oct 6, 2009 at 9:29 PM, Adam Foltzer <acfoltzer@gmail.com> wrote:
>
> > Hi all,
> >
> > I'm trying to set up DataImportHandler to index some XML documents
> > available
> > over web services. The XML includes both content and metadata, so for the
> > indexable content, I'm trying to just index everything under the content
> > tag:
> >
> > <entity dataSource="kbws" name="kbxml" pk="title"
> >        url="resturl" processor="XPathEntityProcessor"
> >        forEach="/document" transformer="HTMLStripTransformer"
> > flatten="true">
> > <field column="content" name="content" xpath="/document/kbml/body"
> > flatten="true" stripHTML="true" />
> > <field column="title" name="title" xpath="/document/kbml/kbq" />
> > </entity>
> >
> > The result of this is that the title field gets populated and indexed
> > (there
> > are no child nodes of /document/kbml/kbq), but content does not get
> indexed
> > at all. Since /document/kbml/body has many children, I expected that
> > flatten="true" would store all of the body text in the field. Instead, it
> > stores nothing at all. I've tried this with many combinations of
> > transformers and flatten options, and the result is the same each time.
> >
> >
> Which Solr version are you using? The flatten attribute was introduced
> after
> 1.3 released.
>
> --
> Regards,
> Shalin Shekhar Mangar.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message