lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alexandre Rafalovitch <arafa...@gmail.com>
Subject Re: dataimporthandler: nested query is called multiple times
Date Wed, 20 Mar 2013 11:07:35 GMT
There was something like this on Stack Overflow:
http://stackoverflow.com/questions/15164166/solr-filelistentityprocessor-is-executing-sub-entities-multiple-times

Upgrading Solr helped partially, but the conclusion was not fully
satisfactory.

Regards,
    Alex.

Personal blog: http://blog.outerthoughts.com/
LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
- Time is the quality of nature that keeps events from happening all at
once. Lately, it doesn't seem to be working.  (Anonymous  - via GTD book)


On Wed, Mar 20, 2013 at 6:48 AM, patrick <preichar@hotmail.com> wrote:

> hi,
>
> the dataimport-config-file i'm using with solr3.6.2 uses a nested select
> statement. the first query retrieves
> the documents while the nested one retrieves the corresponding properties.
>
> <dataConfig>
>   <!--Data source to connect to database-->
>   <dataSource name="db" driver="oracle.jdbc.driver.**OracleDriver"
>        url="jdbc:oracle:thin:@alexis:**1521:FMSPRF01" user="?????"
>        password="?????"/>
>
>   <document name="doc">
>     <entity name="item" query="select DOCID from documents">
>       <field column="DOCID" name="index_id" />
>       <entity name="attributes"
>          query="select * from doc_properties where DOCID='${item.DOCID}'">
>         <!-- do something -->
>       </entity>
>     </entity>
>   </document>
> </dataConfig>
>
> when running the dataimporthandler with the verbose/debug flag turned on
> the output lists more than one query for
> 'entity:attributes' - this list is increased for each 'entity:item':
>
> ....
> <arr name="documents"/>
> <lst name="verbose-output">
>   <lst name="entity:item">
>     <lst name="document#1">
>       <str name="query">select DOCID from documents</str>
>       <str name="time-taken">0:0:0.50</**str>
>       <str>----------- row #1-------------</str>
>       <str name="DOCID">000emnslnbh88hdd<**/str>
>       <str>-------------------------**--------------------</str>
>       <lst name="entity:attributes">
>         <str name="query">select * from doc_properties where
> DOCID='000emnslnbh88hdd'</str>
>         <str name="query">select * from doc_properties where
> DOCID='000emnslnbh88hdd'</str>
>         <str name="time-taken">0:0:0.37</**str>
>         <str name="time-taken">0:0:0.37</**str>
>         <str>----------- row #1-------------</str>
>         <str name="VALUE">I</str>
>         <str name="PROPERTY_KEY">message_**direction</str>
>         <str>-------------------------**--------------------</str>
>         <str>----------- row #2-------------</str>
>         <str name="VALUE">heb@test</str>
>         <str name="PROPERTY_KEY">message_**event_source</str>
>         ....
>
>     <lst name="document#2">
>       <str>----------- row #1-------------</str>
>       <str name="DOCID">000hsjunnbh7weq8<**/str>
>       <str>-------------------------**--------------------</str>
>       <lst name="entity:attributes">
>         <str name="query">select * from doc_properties where
> DOCID='000hsjunnbh7weq8'</str>
>         <str name="query">select * from doc_properties where
> DOCID='000hsjunnbh7weq8'</str>
>         <str name="query">select * from doc_properties where
> DOCID='000hsjunnbh7weq8'</str>
>         <str name="query">select * from doc_properties where
> DOCID='000hsjunnbh7weq8'</str>
>         <str name="time-taken">0:0:0.1</**str>
>         <str name="time-taken">0:0:0.1</**str>
>         <str name="time-taken">0:0:0.1</**str>
>         <str name="time-taken">0:0:0.1</**str>
>         <str>----------- row #1-------------</str>
>         <str name="VALUE">I</str>
>         <str name="PROPERTY_KEY">message_**direction</str>
>         <str>-------------------------**--------------------</str>
>         <str>----------- row #2-------------</str>
>         <str name="VALUE">heb@test</str>
>         <str name="PROPERTY_KEY">message_**event_source</str>
> ...
>
> i was wondering if there's something wrong with my configuration - thank
> you for clarifying,
> patrick
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message