Tim, you saved my day ;) now vsdx files were indexed successfully. Thank you very much!!! summary: as a workaround I have in solr-6.4.0\contrib\extraction\lib: 1. ooxml-schemas-1.3.jar instead of poi-ooxml-schemas-3.15.jar 2. curvesapi-1.03.jar So, now I'm waiting when this will be implemented in a official version of solr/tika. Regards, Gytis On Mon, Feb 6, 2017 at 4:16 PM, Allison, Timothy B. wrote: > Argh. Looks like we need to add curvesapi (BSD 3-clause) to Solr. > > For now, add this jar: > https://mvnrepository.com/artifact/com.github.virtuald/curvesapi/1.03 > > See also [1] > > [1] http://apache-poi.1045710.n5.nabble.com/support-for- > reading-Microsoft-Visio-2013-vsdx-format-td5721500.html > > -----Original Message----- > From: Gytis Mikuciunas [mailto:gytmkc@gmail.com] > Sent: Monday, February 6, 2017 8:19 AM > To: solr-user@lucene.apache.org > Subject: Re: Solr 6.4. Can't index MS Visio vsdx files > > sad, but didn't help. > > what I did: > > 1. stopped solr: bin\solr stop -p 80 > 2. removed poi-ooxml-schemas-3.15.jar from contrib\extraction\lib 3. add > ooxml-schemas-1.3.jar to contrib\extraction\lib 4. restarted solr: bin\solr > start -p 80 -m 4g 5. tried again to parse vsdx file: > > java -Dauto -Dc=db_new02 -Dport=80 -Dfiletypes=vsd,vsdx -Drecursive=yes > -jar example/exampledocs/post.jar "I:\Tools" > > SimplePostTool version 5.0.0 > Posting files to [base] url http://localhost:80/solr/db_new02/update... > Entering auto mode. File endings considered are vsd,vsdx Entering > recursive mode, max depth=999, delay=0s Indexing directory I:\Tools (1 > files, depth=0) POSTing file span ports.vsdx (application/octet-stream) to > [base]/extract > SimplePostTool: WARNING: Solr returned an error #500 (Server Error) for > url: > http://localhost:80/solr/db_new02/update/extract?resource. > name=I%3A%5CTools%5Cspan+ports.vsdx > SimplePostTool: WARNING: Response: http-equiv="Content-Type" content="text/html;charset=utf-8"/> > Error 500 Server Error > >

HTTP ERROR 500

>

Problem accessing /solr/db_new02/update/extract. Reason: >

    Server Error

Caused > by:

java.lang.NoClassDefFoundError: com/graphbuilder/curve/Point
>         at java.lang.Class.getDeclaredConstructors0(Native Method)
>         at java.lang.Class.privateGetDeclaredConstructors(Unknown Source)
>         at java.lang.Class.getConstructor0(Unknown Source)
>         at java.lang.Class.getDeclaredConstructor(Unknown Source)
>         at org.apache.poi.xdgf.util.ObjectFactory.put(
> ObjectFactory.java:34)
>         at
> org.apache.poi.xdgf.usermodel.section.geometry.
> GeometryRowFactory.<clinit>(GeometryRowFactory.java:39)
>         at
> org.apache.poi.xdgf.usermodel.section.GeometrySection.<
> init>(GeometrySection.java:55)
>         at
> org.apache.poi.xdgf.usermodel.XDGFSheet.<init>(XDGFSheet.java:77)
>         at
> org.apache.poi.xdgf.usermodel.XDGFShape.<init>(XDGFShape.java:113)
>         at
> org.apache.poi.xdgf.usermodel.XDGFShape.<init>(XDGFShape.java:107)
>         at
> org.apache.poi.xdgf.usermodel.XDGFBaseContents.onDocumentRead(
> XDGFBaseContents.java:82)
>         at
> org.apache.poi.xdgf.usermodel.XDGFMasterContents.onDocumentRead(
> XDGFMasterContents.java:66)
>         at
> org.apache.poi.xdgf.usermodel.XDGFMasters.onDocumentRead(
> XDGFMasters.java:101)
>         at
> org.apache.poi.xdgf.usermodel.XmlVisioDocument.onDocumentRead(
> XmlVisioDocument.java:106)
>         at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:190)
>         at
> org.apache.poi.xdgf.usermodel.XmlVisioDocument.<init>(
> XmlVisioDocument.java:79)
>         at
> org.apache.poi.xdgf.extractor.XDGFVisioExtractor.<init&
> gt;(XDGFVisioExtractor.java:41)
>         at
> org.apache.poi.extractor.ExtractorFactory.createExtractor(
> ExtractorFactory.java:207)
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(
> OOXMLExtractorFactory.java:86)
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParser.
> parse(OOXMLParser.java:87)
>         at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>         at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>         at
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
>         at
> org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(
> ExtractingDocumentLoader.java:228)
>         at
> org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(
> ContentStreamHandlerBase.java:68)
>         at
> org.apache.solr.handler.RequestHandlerBase.handleRequest(
> RequestHandlerBase.java:166)
>         at org.apache.solr.core.SolrCore.execute(SolrCore.java:2306)
>         at
> org.apache.solr.servlet.HttpSolrCall.execute(HttpSolrCall.java:658)
>         at org.apache.solr.servlet.HttpSolrCall.call(
> HttpSolrCall.java:464)
>         at
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(
> SolrDispatchFilter.java:345)
>         at
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(
> SolrDispatchFilter.java:296)
>         at
> org.eclipse.jetty.servlet.ServletHandler$CachedChain.
> doFilter(ServletHandler.java:1691)
>         at
> org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:582)
>         at
> org.eclipse.jetty.server.handler.ScopedHandler.handle(
> ScopedHandler.java:143)
>         at
> org.eclipse.jetty.security.SecurityHandler.handle(
> SecurityHandler.java:524)
>         at
> org.eclipse.jetty.server.session.SessionHandler.
> doHandle(SessionHandler.java:226)
>         at
> org.eclipse.jetty.server.handler.ContextHandler.
> doHandle(ContextHandler.java:1180)
>         at
> org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:512)
>         at
> org.eclipse.jetty.server.session.SessionHandler.
> doScope(SessionHandler.java:185)
>         at
> org.eclipse.jetty.server.handler.ContextHandler.
> doScope(ContextHandler.java:1112)
>         at
> org.eclipse.jetty.server.handler.ScopedHandler.handle(
> ScopedHandler.java:141)
>         at
> org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(
> ContextHandlerCollection.java:213)
>         at
> org.eclipse.jetty.server.handler.HandlerCollection.
> handle(HandlerCollection.java:119)
>         at
> org.eclipse.jetty.server.handler.HandlerWrapper.handle(
> HandlerWrapper.java:134)
>         at org.eclipse.jetty.server.Server.handle(Server.java:534)
>         at org.eclipse.jetty.server.HttpChannel.handle(
> HttpChannel.java:320)
>         at
> org.eclipse.jetty.server.HttpConnection.onFillable(
> HttpConnection.java:251)
>         at
> org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(
> AbstractConnection.java:273)
>         at org.eclipse.jetty.io.FillInterest.fillable(
> FillInterest.java:95)
>         at
> org.eclipse.jetty.io.SelectChannelEndPoint$2.run(
> SelectChannelEndPoint.java:93)
>         at
> org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.
> executeProduceConsume(ExecuteProduceConsume.java:303)
>         at
> org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.
> produceConsume(ExecuteProduceConsume.java:148)
>         at
> org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.run(
> ExecuteProduceConsume.java:136)
>         at
> org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(
> QueuedThreadPool.java:671)
>         at
> org.eclipse.jetty.util.thread.QueuedThreadPool$2.run(
> QueuedThreadPool.java:589)
>         at java.lang.Thread.run(Unknown Source) Caused by: java.lang.ClassNotFoundException:
> com.graphbuilder.curve.Point
>         at java.net.URLClassLoader.findClass(Unknown Source)
>         at java.lang.ClassLoader.loadClass(Unknown Source)
>         at java.net.FactoryURLClassLoader.loadClass(Unknown Source)
>         at java.lang.ClassLoader.loadClass(Unknown Source)
>         ... 56 more
> 
>

Caused by:

java.lang.ClassNotFoundException:
> com.graphbuilder.curve.Point
>         at java.net.URLClassLoader.findClass(Unknown Source)
>         at java.lang.ClassLoader.loadClass(Unknown Source)
>         at java.net.FactoryURLClassLoader.loadClass(Unknown Source)
>         at java.lang.ClassLoader.loadClass(Unknown Source)
>         at java.lang.Class.getDeclaredConstructors0(Native Method)
>         at java.lang.Class.privateGetDeclaredConstructors(Unknown Source)
>         at java.lang.Class.getConstructor0(Unknown Source)
>         at java.lang.Class.getDeclaredConstructor(Unknown Source)
>         at org.apache.poi.xdgf.util.ObjectFactory.put(
> ObjectFactory.java:34)
>         at
> org.apache.poi.xdgf.usermodel.section.geometry.
> GeometryRowFactory.<clinit>(GeometryRowFactory.java:39)
>         at
> org.apache.poi.xdgf.usermodel.section.GeometrySection.<
> init>(GeometrySection.java:55)
>         at
> org.apache.poi.xdgf.usermodel.XDGFSheet.<init>(XDGFSheet.java:77)
>         at
> org.apache.poi.xdgf.usermodel.XDGFShape.<init>(XDGFShape.java:113)
>         at
> org.apache.poi.xdgf.usermodel.XDGFShape.<init>(XDGFShape.java:107)
>         at
> org.apache.poi.xdgf.usermodel.XDGFBaseContents.onDocumentRead(
> XDGFBaseContents.java:82)
>         at
> org.apache.poi.xdgf.usermodel.XDGFMasterContents.onDocumentRead(
> XDGFMasterContents.java:66)
>         at
> org.apache.poi.xdgf.usermodel.XDGFMasters.onDocumentRead(
> XDGFMasters.java:101)
>         at
> org.apache.poi.xdgf.usermodel.XmlVisioDocument.onDocumentRead(
> XmlVisioDocument.java:106)
>         at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:190)
>         at
> org.apache.poi.xdgf.usermodel.XmlVisioDocument.<init>(
> XmlVisioDocument.java:79)
>         at
> org.apache.poi.xdgf.extractor.XDGFVisioExtractor.<init&
> gt;(XDGFVisioExtractor.java:41)
>         at
> org.apache.poi.extractor.ExtractorFactory.createExtractor(
> ExtractorFactory.java:207)
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(
> OOXMLExtractorFactory.java:86)
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParser.
> parse(OOXMLParser.java:87)
>         at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>         at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>         at
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
>         at
> org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(
> ExtractingDocumentLoader.java:228)
>         at
> org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(
> ContentStreamHandlerBase.java:68)
>         at
> org.apache.solr.handler.RequestHandlerBase.handleRequest(
> RequestHandlerBase.java:166)
>         at org.apache.solr.core.SolrCore.execute(SolrCore.java:2306)
>         at
> org.apache.solr.servlet.HttpSolrCall.execute(HttpSolrCall.java:658)
>         at org.apache.solr.servlet.HttpSolrCall.call(
> HttpSolrCall.java:464)
>         at
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(
> SolrDispatchFilter.java:345)
>         at
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(
> SolrDispatchFilter.java:296)
>         at
> org.eclipse.jetty.servlet.ServletHandler$CachedChain.
> doFilter(ServletHandler.java:1691)
>         at
> org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:582)
>         at
> org.eclipse.jetty.server.handler.ScopedHandler.handle(
> ScopedHandler.java:143)
>         at
> org.eclipse.jetty.security.SecurityHandler.handle(
> SecurityHandler.java:524)
>         at
> org.eclipse.jetty.server.session.SessionHandler.
> doHandle(SessionHandler.java:226)
>         at
> org.eclipse.jetty.server.handler.ContextHandler.
> doHandle(ContextHandler.java:1180)
>         at
> org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:512)
>         at
> org.eclipse.jetty.server.session.SessionHandler.
> doScope(SessionHandler.java:185)
>         at
> org.eclipse.jetty.server.handler.ContextHandler.
> doScope(ContextHandler.java:1112)
>         at
> org.eclipse.jetty.server.handler.ScopedHandler.handle(
> ScopedHandler.java:141)
>         at
> org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(
> ContextHandlerCollection.java:213)
>         at
> org.eclipse.jetty.server.handler.HandlerCollection.
> handle(HandlerCollection.java:119)
>         at
> org.eclipse.jetty.server.handler.HandlerWrapper.handle(
> HandlerWrapper.java:134)
>         at org.eclipse.jetty.server.Server.handle(Server.java:534)
>         at org.eclipse.jetty.server.HttpChannel.handle(
> HttpChannel.java:320)
>         at
> org.eclipse.jetty.server.HttpConnection.onFillable(
> HttpConnection.java:251)
>         at
> org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(
> AbstractConnection.java:273)
>         at org.eclipse.jetty.io.FillInterest.fillable(
> FillInterest.java:95)
>         at
> org.eclipse.jetty.io.SelectChannelEndPoint$2.run(
> SelectChannelEndPoint.java:93)
>         at
> org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.
> executeProduceConsume(ExecuteProduceConsume.java:303)
>         at
> org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.
> produceConsume(ExecuteProduceConsume.java:148)
>         at
> org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.run(
> ExecuteProduceConsume.java:136)
>         at
> org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(
> QueuedThreadPool.java:671)
>         at
> org.eclipse.jetty.util.thread.QueuedThreadPool$2.run(
> QueuedThreadPool.java:589)
>         at java.lang.Thread.run(Unknown Source) 
> > > > >