lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sascha Szott <sz...@zib.de>
Subject Re: Hierarchical xml
Date Wed, 02 Dec 2009 09:11:35 GMT
Pooja,

have a look at Solr's DataImportHandler. XPathEntityProcessor [1] should 
suit your needs.

Best,
Sascha

[1] http://wiki.apache.org/solr/DataImportHandler#XPathEntityProcessor

Pooja Verlani schrieb:
> Hi,
> I want to index an xml like following:
> 
> <officer>
> <name>John</name>
> <dob>1979-29-17T28:14:48Z</dob>
> <collegeGroup>
>         <college>
>                <name>ABC College</name>
>                <year>1998</year>
>          </college>
>          <college>
>                <name>PQRS College</name>
>                <year>2001</year>
>          </college>
>           <college>
>                <name>XYZ College</name>
>                <year>2003</year>
>          </college>
> </collegeGroup>
> </officer>
> 
> I am not able to judge how should be the schema like?
> Also, if I flatten such an xml and make collegename & year as multivalued
> like this:
> <college_name>ABC College, PQRS College, XYZ College</college_name>
> <college_year>1998,2001,2003</year>
> 
> In such a scenario I can't make a coorespondence between ABC college & year
> 1998.
> 
> In case someone has an efficient way out, do share.
> Thanks in anticipation.
> 
> Regards,
> Pooja
> 


Mime
View raw message