lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grant Ingersoll <>
Subject Re: Using SolrJ with Tika
Date Wed, 02 Sep 2009 17:35:19 GMT
Hi Angel,

I'm looking into it.  Might need a new SolrRequest, but still playing  
around and will let you know...


On Sep 2, 2009, at 4:56 AM, Angel Ice wrote:

> Hi everybody.
> I hope it's the right place for questions, if not sorry.
> I'm trying to index rich documents (PDF, MS docs etc) in SolR/Lucene.
> I have seen a few examples explaining how to use tika to solve this.  
> But most of these examples are using curl to send documents to Solr  
> or an HTML POST with an input file.
> But i'd like to do it in full java.
> Is there a way to use Solrj to index the documents with the  
> ExtractingRequestHandler of SolR or at least to get the extracted  
> xml back (with the extract.only option) ?
> Many thanks.
> Laurent.

Grant Ingersoll

Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)  
using Solr/Lucene:

View raw message