lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Zheng Lin Edwin Yeo <edwinye...@gmail.com>
Subject Re: Unable to index rich-text documents in Solr Cloud
Date Fri, 20 Mar 2015 02:19:32 GMT
Hi Shawn,

Yes, I'm using the /update/extract handler. I'm not sure about the
shards.qt parameter too.

Regards,
Edwin


On 19 March 2015 at 13:18, Shawn Heisey <apache@elyograg.org> wrote:

> On 3/18/2015 1:22 AM, Zheng Lin Edwin Yeo wrote:
> > I'm having some issues with indexing rich-text documents from the Solr
> > Cloud. When I tried to index a pdf or word document, I get the following
> > error:
> >
> >
> > org.apache.solr.common.SolrException: Bad Request
> >
> >
> >
> > request:
> http://192.168.2.2:8984/solr/logmill/update?update.distrib=TOLEADER&distrib.from=http%3A%2F%2F192.168.2.2%3A8983%2Fsolr%2Flogmill%2F&wt=javabin&version=2
>
> This request appears to be one of the requests that SolrCloud makes
> between its different nodes, but it is using the /update handler.  I
> assume that when you sent the request, you sent it to the
> /update/extract handler because it's a rich text document?  The /update
> handler can't do rich text documents, it's only for documents in json,
> xml, csv, javabin, etc that are formatted in specific ways.
>
> One thing I'm wondering is whether the Extracting handler requires a
> shards.qt parameter, also set to /update/extract, to work right with
> SolrCloud.  I have never used that handler myself, so I've got no idea
> what is required to make it work right.
>
> Thanks,
> Shawn
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message