lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Péter Király <kirun...@gmail.com>
Subject Re: XPath with ExtractingRequestHandler
Date Thu, 15 Dec 2011 08:50:57 GMT
Hi,

maybe I am wrong, but the // should be at the beggining of the
expression, like
//xhtml:div[@class='bibliographicData']/descendant:node(),
or if you want to search the div inside body, you have to use descendant like
/xhtml:html/xhtml:body/descendant::xhtml:div[@class='bibliographicData']/descendant:node()

Péter

2011/12/14 Michael Kelleher <mj.kelleher@gmail.com>:
> I want to restrict the HTML that is returned by Tika to basically:
>
>
>  /xhtml:html/xhtml:body//xhtml:div[@class='bibliographicData']/descendant:node()
>
>
> and it seems that the XPath class being used does not support the '//'
> syntax.
>
> Is there anyway to configure Tika to use a different XPath evaluation class?
>
>



-- 
Péter Király
eXtensible Catalog
http://eXtensibleCatalog.org
http://drupal.org/project/xc

Mime
View raw message