lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Noble Paul നോബിള്‍ नोब्ळ् <noble.p...@corp.aol.com>
Subject Re: How to import multiple RSS-feeds with DIH
Date Mon, 09 Nov 2009 08:46:09 GMT
On Mon, Nov 9, 2009 at 1:26 PM, Michael Lackhoff <michael@lackhoff.de> wrote:
> [A new thread for this particular problem]
>
> On 09.11.2009 08:44 Noble Paul നോബിള്‍ नोब्ळ् wrote:
>
>> The tried and tested strategy is to post the question in this mailing
>> list w/ your data-config.xml.
>
> See my data-config.xml below. The first is the usual slashdot example
> with my 'id' addition, the second a very simple addtional feed. The
> second example works if I delete the slashdot-feed but as I said I would
> like to have them both.
When you say , the second example does not work , what does it mean?
some exception?(if yes, please post the stacktrace)
>
> -Michael
>
> <dataConfig>
>  <dataSource type="HttpDataSource" />
>    <document>
>      <entity name="slashdot"
>        pk="link"
>        url="http://rss.slashdot.org/Slashdot/slashdot"
>        processor="XPathEntityProcessor"
>        forEach="/RDF/channel | /RDF/item"
>        transformer="TemplateTransformer,DateFormatTransformer">
>
>        <field column="source"       xpath="/RDF/channel/title"
> commonField="true" />
>        <field column="source-link"  xpath="/RDF/channel/link"
> commonField="true" />
>        <field column="subject"      xpath="/RDF/channel/subject"
> commonField="true" />
>
>        <field column="title"        xpath="/RDF/item/title" />
>        <field column="link"         xpath="/RDF/item/link" />
>        <field column="id"           template="${slashdot.link}" />
>        <field column="description"  xpath="/RDF/item/description" />
>        <field column="creator"      xpath="/RDF/item/creator" />
>        <field column="item-subject" xpath="/RDF/item/subject" />
>
>        <field column="slash-department" xpath="/RDF/item/department" />
>        <field column="slash-section"    xpath="/RDF/item/section" />
>        <field column="slash-comments"   xpath="/RDF/item/comments" />
>        <field column="date" xpath="/RDF/item/date"
> dateTimeFormat="yyyy-MM-dd'T'hh:mm:ss" />
>      </entity>
>      <entity name="heise"
>        pk="link"
>        url="http://www.heise.de/newsticker/heise.rdf"
>        processor="XPathEntityProcessor"
>        forEach="/RDF/channel | /RDF/item"
>        transformer="TemplateTransformer">
>        <field column="source"       xpath="/RDF/channel/title"
> commonField="true" />
>        <field column="source-link"  xpath="/RDF/channel/link"
> commonField="true" />
>
>        <field column="title"        xpath="/RDF/item/title" />
>        <field column="link"         xpath="/RDF/item/link" />
>        <field column="id"           template="${heise.link}" />
>      </entity>
>    </document>
> </dataConfig>
>



-- 
-----------------------------------------------------
Noble Paul | Principal Engineer| AOL | http://aol.com

Mime
View raw message