manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From msaunier <msaun...@citya.com>
Subject RE: Out of memory, one file bug i think
Date Tue, 24 Jul 2018 13:12:37 GMT
With debug:

 

[Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn
- Client session timed out, have not heard from server in 28034ms for sessionid 0x100000050ae0049

[Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Client session timed out, have not heard from server in 28034ms for sessionid 0x100000050ae0049,
closing socket connection and attempting reconnect

[Thread-31532-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn
- Client session timed out, have not heard from server in 27708ms for sessionid 0xff00000201970044

[Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn
- Client session timed out, have not heard from server in 27737ms for sessionid 0xff00000201970043

[Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Client session timed out, have not heard from server in 27737ms for sessionid 0xff00000201970043,
closing socket connection and attempting reconnect

[Thread-31551-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn
- Client session timed out, have not heard from server in 28316ms for sessionid 0x100000050ae004b

[Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn
- Client session timed out, have not heard from server in 28394ms for sessionid 0x2000000b80d0047

[Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Client session timed out, have not heard from server in 28394ms for sessionid 0x2000000b80d0047,
closing socket connection and attempting reconnect

[Thread-31532-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Client session timed out, have not heard from server in 27708ms for sessionid 0xff00000201970044,
closing socket connection and attempting reconnect

[Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Opening socket connection to server kemp-formation-solr.citya.local/192.168.37.107:2181.
Will not attempt to authenticate using SASL (unknown error)

agents process ran out of memory - shutting down

[Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Socket connection established to kemp-formation-solr.citya.local/192.168.37.107:2181, initiating
session

[Thread-7538-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn
- Client session timed out, have not heard from server in 36805ms for sessionid 0x2000000b80d0046

[Thread-7538-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Client session timed out, have not heard from server in 36805ms for sessionid 0x2000000b80d0046,
closing socket connection and attempting reconnect

java.lang.OutOfMemoryError: GC overhead limit exceeded

        at java.lang.StringBuilder.toString(StringBuilder.java:407)

        at org.apache.manifoldcf.core.cachemanager.CacheManager.readSharedData(CacheManager.java:849)

        at org.apache.manifoldcf.core.cachemanager.CacheManager.hasExpired(CacheManager.java:483)

        at org.apache.manifoldcf.core.cachemanager.CacheManager.lookupObject(CacheManager.java:454)

        at org.apache.manifoldcf.core.cachemanager.CacheManager.findObjectsAndExecute(CacheManager.java:131)

        at org.apache.manifoldcf.core.database.Database.executeQuery(Database.java:204)

        at org.apache.manifoldcf.core.database.DBInterfacePostgreSQL.performQuery(DBInterfacePostgreSQL.java:862)

        at org.apache.manifoldcf.core.database.BaseTable.performQuery(BaseTable.java:236)

        at org.apache.manifoldcf.crawler.jobs.Jobs.deletingJobsPresent(Jobs.java:3133)

        at org.apache.manifoldcf.crawler.jobs.JobManager.getNextDeletableDocuments(JobManager.java:1862)

        at org.apache.manifoldcf.crawler.system.DocumentDeleteStufferThread.run(DocumentDeleteStufferThread.java:108)

[Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Opening socket connection to server kemp-formation-solr.citya.local/192.168.37.107:2181.
Will not attempt to authenticate using SASL (unknown error)

agents process ran out of memory - shutting down

[Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn
- Client session timed out, have not heard from server in 27763ms for sessionid 0x100000050ae004a

[Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Client session timed out, have not heard from server in 27763ms for sessionid 0x100000050ae004a,
closing socket connection and attempting reconnect

[zkCallback-3-thread-7] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@7a5c701e
name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Disconnected
type:None path:null path: null type: None

[zkCallback-3-thread-7] WARN org.apache.solr.common.cloud.ConnectionManager - zkClient has
disconnected

[Thread-31551-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Client session timed out, have not heard from server in 28316ms for sessionid 0x100000050ae004b,
closing socket connection and attempting reconnect

java.lang.OutOfMemoryError: GC overhead limit exceeded

[Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Socket connection established to kemp-formation-solr.citya.local/192.168.37.107:2181, initiating
session

[zkCallback-11-thread-5] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@53181a58
name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Disconnected
type:None path:null path: null type: None

[zkCallback-11-thread-5] WARN org.apache.solr.common.cloud.ConnectionManager - zkClient has
disconnected

[Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn
- Unable to reconnect to ZooKeeper service, session 0xff00000201970043 has expired

[Thread-7573-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Unable to reconnect to ZooKeeper service, session 0xff00000201970043 has expired, closing
socket connection

[Thread-7573-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for
session: 0xff00000201970043

[zkCallback-11-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@53181a58
name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Expired
type:None path:null path: null type: None

[zkCallback-11-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager - Our previous
ZooKeeper session was expired. Attempting to reconnect to recover relationship with ZooKeeper...

[Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] WARN org.apache.zookeeper.ClientCnxn
- Unable to reconnect to ZooKeeper service, session 0x100000050ae0049 has expired

[Thread-5234-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Unable to reconnect to ZooKeeper service, session 0x100000050ae0049 has expired, closing
socket connection

[zkCallback-11-thread-2] WARN org.apache.solr.common.cloud.DefaultConnectionStrategy - Connection
expired - starting a new one...

[zkCallback-11-thread-2] INFO org.apache.zookeeper.ZooKeeper - Initiating client connection,
connectString=kemp-formation-solr:2181 sessionTimeout=60000 watcher=org.apache.solr.common.cloud.ConnectionManager@53181a58

[Thread-5234-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for
session: 0x100000050ae0049

[zkCallback-3-thread-4] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@7a5c701e
name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Expired
type:None path:null path: null type: None

[zkCallback-3-thread-4] WARN org.apache.solr.common.cloud.ConnectionManager - Our previous
ZooKeeper session was expired. Attempting to reconnect to recover relationship with ZooKeeper...

[zkCallback-3-thread-4] WARN org.apache.solr.common.cloud.DefaultConnectionStrategy - Connection
expired - starting a new one...

[zkCallback-3-thread-4] INFO org.apache.zookeeper.ZooKeeper - Initiating client connection,
connectString=kemp-formation-solr:2181 sessionTimeout=60000 watcher=org.apache.solr.common.cloud.ConnectionManager@7a5c701e

[zkCallback-3-thread-4-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Opening socket connection to server kemp-formation-solr.citya.local/192.168.37.107:2181.
Will not attempt to authenticate using SASL (unknown error)

[zkCallback-11-thread-2-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Opening socket connection to server kemp-formation-solr.citya.local/192.168.37.107:2181.
Will not attempt to authenticate using SASL (unknown error)

[zkCallback-3-thread-4-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Socket connection established to kemp-formation-solr.citya.local/192.168.37.107:2181, initiating
session

[zkCallback-11-thread-2-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Socket connection established to kemp-formation-solr.citya.local/192.168.37.107:2181, initiating
session

[Thread-490] INFO org.eclipse.jetty.server.ServerConnector - Stopped ServerConnector@2a640157{HTTP/1.1}{0.0.0.0:8345}

[zkCallback-3-thread-4-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Session establishment complete on server kemp-formation-solr.citya.local/192.168.37.107:2181,
sessionid = 0x2000000b80d0049, negotiated timeout = 40000

[zkCallback-11-thread-2-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Session establishment complete on server kemp-formation-solr.citya.local/192.168.37.107:2181,
sessionid = 0xff00000201970045, negotiated timeout = 40000

agents process ran out of memory - shutting down

java.lang.OutOfMemoryError: GC overhead limit exceeded

agents process ran out of memory - shutting down

java.lang.OutOfMemoryError: GC overhead limit exceeded

        at java.util.HashMap.newNode(HashMap.java:1747)

        at java.util.HashMap.putVal(HashMap.java:631)

        at java.util.HashMap.put(HashMap.java:612)

        at jcifs.util.transport.Transport.sendrecv(Transport.java:66)

        at jcifs.smb.SmbTransport.send(SmbTransport.java:661)

        at jcifs.smb.SmbSession.send(SmbSession.java:238)

        at jcifs.smb.SmbTree.send(SmbTree.java:119)

        at jcifs.smb.SmbFile.send(SmbFile.java:776)

        at jcifs.smb.SmbFileInputStream.readDirect(SmbFileInputStream.java:181)

        at jcifs.smb.SmbFileInputStream.read(SmbFileInputStream.java:142)

        at org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:903)

        at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)

[zkCallback-11-thread-2] INFO org.apache.solr.common.cloud.ConnectionManager - Connection
with ZooKeeper reestablished.

[zkCallback-3-thread-4] INFO org.apache.solr.common.cloud.ConnectionManager - Connection with
ZooKeeper reestablished.

agents process ran out of memory - shutting down

java.lang.OutOfMemoryError: GC overhead limit exceeded

[zkCallback-11-thread-2] INFO org.apache.solr.common.cloud.DefaultConnectionStrategy - Reconnected
to ZooKeeper

[zkCallback-11-thread-2] INFO org.apache.solr.common.cloud.ConnectionManager - Connected:true

[zkCallback-3-thread-4] INFO org.apache.solr.common.cloud.DefaultConnectionStrategy - Reconnected
to ZooKeeper

[zkCallback-3-thread-4] INFO org.apache.solr.common.cloud.ConnectionManager - Connected:true

[Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: 0x2000000b80d0046 closed

[zkCallback-21-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@381a7557
name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Disconnected
type:None path:null path: null type: None

[zkCallback-21-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager - zkClient has
disconnected

[Thread-7538-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for
session: 0x2000000b80d0046

agents process ran out of memory - shutting down

java.lang.OutOfMemoryError: GC overhead limit exceeded

        at java.util.regex.Matcher.<init>(Matcher.java:225)

        at java.util.regex.Pattern.matcher(Pattern.java:1093)

        at de.l3s.boilerpipe.util.UnicodeTokenizer.tokenize(UnicodeTokenizer.java:40)

        at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.flushBlock(BoilerpipeHTMLContentHandler.java:296)

        at de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.characters(BoilerpipeHTMLContentHandler.java:198)

        at org.apache.tika.parser.html.BoilerpipeContentHandler.characters(BoilerpipeContentHandler.java:155)

        at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)

        at org.apache.tika.sax.SecureContentHandler.characters(SecureContentHandler.java:270)

        at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)

        at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)

        at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)

        at org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHandler.java:46)

        at org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandler.java:82)

        at org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler.java:140)

        at org.apache.tika.sax.SafeContentHandler.characters(SafeContentHandler.java:287)

        at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandler.java:279)

        at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)

        at org.apache.tika.sax.xpath.MatchingContentHandler.characters(MatchingContentHandler.java:85)

        at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)

        at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)

        at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)

        at org.apache.tika.sax.SecureContentHandler.characters(SecureContentHandler.java:270)

        at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)

        at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)

        at org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)

        at org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHandler.java:46)

        at org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandler.java:82)

        at org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler.java:140)

        at org.apache.tika.sax.SafeContentHandler.characters(SafeContentHandler.java:287)

        at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandler.java:279)

        at org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandler.java:306)

        at org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator$SheetTextAsHTML.cell(XSSFExcelExtractorDecorator.java:431)

[zkCallback-19-thread-5] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@43f7378f
name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Disconnected
type:None path:null path: null type: None

[zkCallback-19-thread-5] WARN org.apache.solr.common.cloud.ConnectionManager - zkClient has
disconnected

[zkCallback-15-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@6432608f
name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Disconnected
type:None path:null path: null type: None

[zkCallback-15-thread-2] WARN org.apache.solr.common.cloud.ConnectionManager - zkClient has
disconnected

[zkCallback-13-thread-3] WARN org.apache.solr.common.cloud.ConnectionManager - Watcher org.apache.solr.common.cloud.ConnectionManager@68bb3d74
name: ZooKeeperConnection Watcher:kemp-formation-solr:2181 got event WatchedEvent state:Disconnected
type:None path:null path: null type: None

[zkCallback-13-thread-3] WARN org.apache.solr.common.cloud.ConnectionManager - zkClient has
disconnected

agents process ran out of memory - shutting down

java.lang.OutOfMemoryError: GC overhead limit exceeded

        at sun.nio.cs.UTF_8.newEncoder(UTF_8.java:72)

        at java.lang.StringCoding.encode(StringCoding.java:348)

        at java.lang.String.getBytes(String.java:941)

        at org.postgresql.core.Utils.encodeUTF8(Utils.java:53)

        at org.postgresql.core.v3.QueryExecutorImpl.sendParse(QueryExecutorImpl.java:1448)

        at org.postgresql.core.v3.QueryExecutorImpl.sendOneQuery(QueryExecutorImpl.java:1777)

        at org.postgresql.core.v3.QueryExecutorImpl.sendQuery(QueryExecutorImpl.java:1354)

        at org.postgresql.core.v3.QueryExecutorImpl.execute(QueryExecutorImpl.java:292)

        at org.postgresql.jdbc.PgStatement.executeInternal(PgStatement.java:428)

        at org.postgresql.jdbc.PgStatement.execute(PgStatement.java:354)

        at org.postgresql.jdbc.PgStatement.executeWithFlags(PgStatement.java:301)

        at org.postgresql.jdbc.PgStatement.executeCachedSql(PgStatement.java:287)

        at org.postgresql.jdbc.PgStatement.executeWithFlags(PgStatement.java:264)

        at org.postgresql.jdbc.PgStatement.execute(PgStatement.java:260)

        at org.apache.manifoldcf.core.database.Database.execute(Database.java:876)

        at org.apache.manifoldcf.core.database.Database$ExecuteQueryThread.run(Database.java:696)

[Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: 0xff00000201970044 closed

[Thread-31532-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for
session: 0xff00000201970044

[Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Opening socket connection to server kemp-formation-solr.citya.local/192.168.37.107:2181.
Will not attempt to authenticate using SASL (unknown error)

[Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Socket connection established to kemp-formation-solr.citya.local/192.168.37.107:2181, initiating
session

[Thread-7574-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Session establishment complete on server kemp-formation-solr.citya.local/192.168.37.107:2181,
sessionid = 0x100000050ae004a, negotiated timeout = 40000

[Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: 0x100000050ae004a closed

[Thread-7574-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for
session: 0x100000050ae004a

[Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Opening socket connection to server kemp-formation-solr.citya.local/192.168.37.107:2181.
Will not attempt to authenticate using SASL (unknown error)

[Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Socket connection established to kemp-formation-solr.citya.local/192.168.37.107:2181, initiating
session

[Thread-7602-SendThread(kemp-formation-solr.citya.local:2181)] INFO org.apache.zookeeper.ClientCnxn
- Session establishment complete on server kemp-formation-solr.citya.local/192.168.37.107:2181,
sessionid = 0x2000000b80d0047, negotiated timeout = 40000

[Thread-490] INFO org.apache.zookeeper.ZooKeeper - Session: 0x2000000b80d0047 closed

[Thread-7602-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for
session: 0x2000000b80d0047

[Thread-490] INFO org.eclipse.jetty.server.handler.ContextHandler - Stopped o.e.j.w.WebAppContext@44d52de2{/mcf-api-service,file:/tmp/jetty-0.0.0.0-8345-mcf-api-service.war-_mcf-api-service-any-5748290590258150821.dir/webapp/,UNAVAILABLE}{/opt/manifoldcf-trunk/bin/./../web-proprietary/war/mcf-api-service.war}

[Thread-490] INFO org.eclipse.jetty.server.handler.ContextHandler - Stopped o.e.j.w.WebAppContext@60410cd{/mcf-authority-service,file:/tmp/jetty-0.0.0.0-8345-mcf-authority-service.war-_mcf-authority-service-any-1380683823589504600.dir/webapp/,UNAVAILABLE}{/opt/manifoldcf-trunk/bin/./../web-proprietary/war/mcf-authority-service.war}
<mailto:o.e.j.w.WebAppContext@60410cd%7b/mcf-authority-service,file:/tmp/jetty-0.0.0.0-8345-mcf-authority-service.war-_mcf-authority-service-any-1380683823589504600.dir/webapp/,UNAVAILABLE%7d%7b/opt/manifoldcf-trunk/bin/./../web-proprietary/war/mcf-authority-service.war%7d>


 

 

Any idea?

Thanks.

 

 

 

De : Karl Wright [mailto:daddywri@gmail.com] 
Envoyé : mardi 24 juillet 2018 13:15
À : user@manifoldcf.apache.org
Objet : Re: Out of memory, one file bug i think

 

I've opened CONNECTORS-1516 to track the Class Not Found issue, and also created an Apache
POI bugzilla ticket, which is referenced.

 

Karl

 

 

On Tue, Jul 24, 2018 at 6:15 AM Karl Wright <daddywri@gmail.com <mailto:daddywri@gmail.com>
> wrote:

The "class not found" error looks probably like a classloader issue with Tika -- the class
is present in poi-ooxml-3.17.jar, although to be fair it might possibly be caused by an out-of-memory
condition.

You should be able to find the exception in the Simple History and figure out what document
it came from from that.  If not, then look at the log prior to the exception, and look at
what Worker Thread 1 was doing.

 

Karl

 

 

On Tue, Jul 24, 2018 at 5:58 AM msaunier <msaunier@citya.com <mailto:msaunier@citya.com>
> wrote:

Re Karl,

 

I have an Out of Memory Error today. I think I have an error with a document. I have this
WARNING before crash:

 

------------------------------------------------------------------------

 

WARN 2018-07-24T11:46:22,098 (Worker thread '1') - Tika: Tika exception extracting: TIKA-198:
Illegal IOException from org.apache.tika.parser.microsoft.OfficeParser@62980adb <mailto:org.apache.tika.parser.microsoft.OfficeParser@62980adb>


org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from org.apache.tika.parser.microsoft.OfficeParser@62980adb
<mailto:org.apache.tika.parser.microsoft.OfficeParser@62980adb> 

        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:286) ~[tika-core-1.17.jar:1.17]

        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) ~[tika-core-1.17.jar:1.17]

        at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143) ~[tika-core-1.17.jar:1.17]

        at org.apache.manifoldcf.agents.transformation.tika.TikaParser.parse(TikaParser.java:74)
~[mcf-tika-connector.jar:?]

        at org.apache.manifoldcf.agents.transformation.tika.TikaExtractor.addOrReplaceDocumentWithException(TikaExtractor.java:235)
[mcf-tika-connector.jar:?]

        at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineAddEntryPoint.addOrReplaceDocumentWithException(IncrementalIngester.java:3226)
[mcf-agents.jar:?]

        at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineAddFanout.sendDocument(IncrementalIngester.java:3077)
[mcf-agents.jar:?]

        at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester$PipelineObjectWithVersions.addOrReplaceDocumentWithException(IncrementalIngester.java:2708)
[mcf-agents.jar:?]

        at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.documentIngest(IncrementalIngester.java:756)
[mcf-agents.jar:?]

        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocumentWithException(WorkerThread.java:1583)
[mcf-pull-agent.jar:?]

        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocumentWithException(WorkerThread.java:1548)
[mcf-pull-agent.jar:?]

        at org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:939)
[mcf-jcifs-connector.jar:?]

        at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399) [mcf-pull-agent.jar:?]

Caused by: java.io.IOException: java.lang.ClassNotFoundException: org.apache.poi.poifs.crypt.agile.AgileEncryptionInfoBuilder

        at org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:150)
~[?:?]

        at org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:102)
~[?:?]

       at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:203) ~[?:?]

        at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:132) ~[?:?]

        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) ~[?:?]

        ... 12 more

Caused by: java.lang.ClassNotFoundException: org.apache.poi.poifs.crypt.agile.AgileEncryptionInfoBuilder

        at java.net.URLClassLoader.findClass(URLClassLoader.java:381) ~[?:1.8.0_171]

        at java.lang.ClassLoader.loadClass(ClassLoader.java:424) ~[?:1.8.0_171]

        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) ~[?:1.8.0_171]

        at java.lang.ClassLoader.loadClass(ClassLoader.java:357) ~[?:1.8.0_171]

        at org.apache.poi.poifs.crypt.EncryptionInfo.getBuilder(EncryptionInfo.java:222) ~[?:?]

        at org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:148)
~[?:?]

        at org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:102)
~[?:?]

        at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:203) ~[?:?]

        at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:132) ~[?:?]

        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) ~[?:?]

        ... 12 more

 

I think it’s a file, because RAM allocation have a weird behavior. In one second, ManifoldCF
(or Tika) allocate +6Go RAM.

 



 

How Can I find the file?

 

Thanks,

Maxence,


Mime
View raw message