manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Fred Schmitt" <fredschmit...@web.de>
Subject Re: MCF: XML parsing Error
Date Tue, 12 Oct 2010 13:07:39 GMT
It seems like the parsing from http to xml on the way to Solr did not work
and thus the error was thrown. After a few unsuccesful attempts to fix that
I reinstalled Solr. It works fine now, i guess. Thanks for your help&hints.

best regards,
Fred

-----Ursprüngliche Nachricht-----
Von: "Karl Wright" <daddywri@gmail.com>
Gesendet: 12.10.2010 11:06:25
An: connectors-user@incubator.apache.org
Betreff: Re: MCF: XML parsing Error

>It looks like you are getting back HTML from whatever it is that you
>pointed your solr connection at, rather than the XML that the solr
>connector is expecting.  Specifically, the Solr commit operation is
>failing.  Can you get any feedback from the Solr instance?
>
>Karl
>
>On Tue, Oct 12, 2010 at 2:46 AM, Fred Schmitt <fredschmitt83@web.de> wrote:
>> Hi all,
>> I am having a problem while trying to index/crawl data. I configured a job with a
Solr output connection and a web connection,
>> but after i use the "agentRun"-command and start the job, a few exceptions are thrown.
>>
>>
>> On the console im getting:
>>
>> [Fatal Error] :112:120: The element type "HR" must be terminated by the matching
end-tag "".
>> org.apache.manifoldcf.core.interfaces.ManifoldCFException: XML parsing error: The
element type "HR" must be terminated by th
>>  matching end-tag "".
>>  at org.apache.manifoldcf.core.common.XMLDoc.init(XMLDoc.java:369)
>>  at org.apache.manifoldcf.core.common.XMLDoc.(XMLDoc.java:317)
>>  at org.apache.manifoldcf.agents.output.solr.HttpPoster.getResponse(HttpPoster.java:608)
>>  at org.apache.manifoldcf.agents.output.solr.HttpPoster$CommitThread.run(HttpPoster.java:1647)
>> Caused by: org.xml.sax.SAXParseException: The element type "HR" must be terminated
by the matching end-tag "".
>>  at org.apache.xerces.parsers.DOMParser.parse(Unknown Source)
>>  at org.apache.xerces.jaxp.DocumentBuilderImpl.parse(Unknown Source)
>>  at javax.xml.parsers.DocumentBuilder.parse(DocumentBuilder.java:124)
>>  at org.apache.manifoldcf.core.common.XMLDoc.init(XMLDoc.java:365)
>>  ... 3 more
>> Shutting down...
>>
>> and in the log-file i am getting this error:
>>
>> [2010-10-11 15:32:02,515]ERROR Error connecting to update request API: 'HTTP/1.1
500 Internal Server Error
>> '
>> org.apache.manifoldcf.core.interfaces.ManifoldCFException:
>>  Error connecting to update request API: 'HTTP/1.1 500 Internal Server
>> Error
>> '
>> at org.apache.manifoldcf.agents.output.solr.HttpPoster$CommitThread.run(HttpPoster.java:1665)
>>
>>
>> It would be great if you could help me.
>>
>> best regards,
>> Fred
>> ___________________________________________________________
>> WEB.DE DSL Doppel-Flat ab 19,99 €/mtl.! Jetzt auch mit
>> gratis Notebook-Flat! http://produkte.web.de/go/DSL_Doppel_Flatrate/2
>>
___________________________________________________________
GRATIS! Movie-FLAT mit über 300 Videos. 
Jetzt freischalten unter http://movieflat.web.de

Mime
View raw message