manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Kamil Żyta <kamil.z...@pwr.edu.pl>
Subject Re: ElastiSearch missing doc
Date Tue, 16 Dec 2014 10:34:26 GMT
thx Karl but now I have new issue:

FATAL 2014-12-16 11:12:58,496 (Worker thread '47') - Error tossed: Could not initialize class
org.apache.commons.compress.archivers.sevenz.Coders
java.lang.NoClassDefFoundError: Could not initialize class org.apache.commons.compress.archivers.sevenz.Coders
        at org.apache.commons.compress.archivers.sevenz.SevenZFile.readEncodedHeader(SevenZFile.java:279)
        at org.apache.commons.compress.archivers.sevenz.SevenZFile.readHeaders(SevenZFile.java:191)
        at org.apache.commons.compress.archivers.sevenz.SevenZFile.<init>(SevenZFile.java:95)
        at org.apache.commons.compress.archivers.sevenz.SevenZFile.<init>(SevenZFile.java:117)
        at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:130)
        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244)
        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244)
        at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:121)
        at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.addOrReplaceDocumentWithException(TikaExtractor.java:230)
        at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIngester.java:3257)
        at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineAddFanout.sendDocument(IncrementalIngester.java:3108)
        at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineObjectWithVersions.addOrReplaceDocumentWithException(IncrementalIngester.java:2739)
        at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.documentIngest(IncrementalIngester.java:792)
        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocumentWithException(WorkerThread.java:1610)
        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocumentWithException(WorkerThread.java:1558)
        at org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:911)
        at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:383)

And another question: I use Solr 4.10 with Tika 1.5. MCF 1.8 have tika 1.6. How this affect
document parsing?

K

On Mon, Dec 15, 2014 at 08:45:31AM -0500, Karl Wright wrote:
> If you changed this file, you would need to rerun initialize.sh in order to
> register the connector.
> 
> Karl
> 
> 
> On Mon, Dec 15, 2014 at 8:42 AM, Kamil Żyta <kamil.zyta@pwr.edu.pl> wrote:
> >
> > the same as connectors.xml:
> > (...)
> > <repositoryconnector name="Windows shares"
> > class="org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector"/>
> > (...)
> >
> > K
> >
> > On Mon, Dec 15, 2014 at 08:39:07AM -0500, Karl Wright wrote:
> > > Hi Kamil,
> > >
> > > What does connectors-proprietary.xml say about the jcifs connector?
> > >
> > > Karl
> > >
> > >
> > > On Mon, Dec 15, 2014 at 8:35 AM, Kamil Żyta <kamil.zyta@pwr.edu.pl>
> > wrote:
> > > >
> > > > Right, thx. Another problem:
> > > > >
> > > >
> > org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector(uninstalled)
> > > >
> > > > properties.xml:
> > > > <libdir path="../connector-lib-proprietary"/>
> > > >
> > > > > cat ../connectors.xml
> > > > <repositoryconnector name="Windows shares"
> > > >
> > class="org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector"/>
> > > >
> > > > > ls ../connector-lib-proprietary
> > > > jcifs.jar
> > > >
> > > > I think I checked/restarted everything.
> > > >
> > > > K
> > > >
> > > > On Mon, Dec 15, 2014 at 08:00:12AM -0500, Karl Wright wrote:
> > > > > You have to run ./initialize.sh on the MCF 1.8 codebase for the
> > upgrade
> > > > to
> > > > > take place.
> > > > >
> > > > > Karl
> > > > >
> > > > >
> > > > > On Mon, Dec 15, 2014 at 7:43 AM, Kamil Żyta <kamil.zyta@pwr.edu.pl>
> > > > wrote:
> > > > > >
> > > > > > With release-1.8-branch is the same problem.
> > > > > >
> > > > > > K
> > > > > >
> > > > > > On Mon, Dec 15, 2014 at 06:47:12AM -0500, Karl Wright wrote:
> > > > > > > Hi Kamil,
> > > > > > >
> > > > > > > You cannot upgrade to trunk from 1.x.
> > > > > > >
> > > > > > > Try upgrading to branches/release-1.8-branch.
> > > > > > >
> > > > > > > Karl
> > > > > > >
> > > > > > >
> > > > > > > On Mon, Dec 15, 2014 at 3:39 AM, Kamil Żyta <
> > kamil.zyta@pwr.edu.pl>
> > > > > > wrote:
> > > > > > > >
> > > > > > > > Hi,
> > > > > > > > after upgrading to trunk I get 'Database exception:
> > SQLException
> > > > doing
> > > > > > > > query (42703): ERROR: column "needpriority" does not
exist'.
> > > > > > > > How can I upgrade db schema? I tried ./initialize.sh
without
> > > > success.
> > > > > > > >
> > > > > > > > K
> > > > > > > >
> > > > > > > > On Fri, Dec 12, 2014 at 10:40:39AM -0500, Karl Wright
wrote:
> > > > > > > > > Ok, committed a fix. CONNECTORS-1121.
> > > > > > > > >
> > > > > > > > > Karl
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > On Fri, Dec 12, 2014 at 10:32 AM, Karl Wright
<
> > > > daddywri@gmail.com>
> > > > > > > > wrote:
> > > > > > > > > >
> > > > > > > > > > Ah, thanks, this is due to changes I made
yesterday.
> > > > > > > > > >
> > > > > > > > > > Hold on.
> > > > > > > > > > Karl
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > On Fri, Dec 12, 2014 at 10:12 AM, Kamil
Żyta <
> > > > > > kamil.zyta@pwr.edu.pl>
> > > > > > > > > > wrote:
> > > > > > > > > >>
> > > > > > > > > >> On Fri, Dec 12, 2014 at 09:55:41AM -0500,
Karl Wright
> > wrote:
> > > > > > > > > >> > I've created CONNECTORS-1120 for
this fix.  I should
> > have
> > > > > > something
> > > > > > > > to
> > > > > > > > > >> try
> > > > > > > > > >> > shortly.
> > > > > > > > > >> >
> > > > > > > > > >>
> > > > > > > > > >> I can't build mcf from source:
> > > > > > > > > >> BUILD FAILED
> > > > > > > > > >> /opt/mcf-trunk/build.xml:1438: Can't
get
> > > > > > > > > >>
> > > > > > > >
> > > > > >
> > > >
> > https://www.apache.org/dist/manifoldcf/apache-manifoldcf-elasticsearch-plugin-2.0-bin.zip
> > > > > > > > > >> to
> > > > > > > > > >>
> > > > > > > >
> > > > > >
> > > >
> > /opt/mcf-trunk/build/download/apache-manifoldcf-elasticsearch-plugin-bin.zip
> > > > > > > > > >>
> > > > > > > > > >> K
> > > > > > > > > >>
> > > > > > > > > >
> > > > > > > >
> > > > > >
> > > >
> >

Mime
View raw message