lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Zheng Lin Edwin Yeo <>
Subject Unable to index rich-text documents in Solr Cloud
Date Wed, 18 Mar 2015 07:22:51 GMT
Hi everyone,

I'm having some issues with indexing rich-text documents from the Solr
Cloud. When I tried to index a pdf or word document, I get the following

org.apache.solr.common.SolrException: Bad Request

	at org.apache.solr.client.solrj.impl.ConcurrentUpdateSolrClient$
	at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
	at java.util.concurrent.ThreadPoolExecutor$ Source)
	at Source)

I'm able to index .xml and .csv files in Solr Cloud with the same configuration.

I have setup Solr Cloud using the default zookeeper in Solr 5.0.0, and
I have 2 shards with the following details:

Prior to this, I'm already able to index rich-text documents without
the Solr Cloud, and I'm using the same solrconfig.xml and schema.xml,
so my ExtractRequestHandler is already defined.

Is there other settings required in order to index rich-text documents
in Solr Cloud?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message