tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "GURFAN (JIRA)" <j...@apache.org>
Subject [jira] [Created] (TIKA-1187) java.lang.OutOfMemoryError: Java heap space
Date Wed, 23 Oct 2013 06:37:42 GMT
GURFAN created TIKA-1187:
----------------------------

             Summary: java.lang.OutOfMemoryError: Java heap space
                 Key: TIKA-1187
                 URL: https://issues.apache.org/jira/browse/TIKA-1187
             Project: Tika
          Issue Type: Bug
          Components: general
    Affects Versions: 1.3
         Environment: Ubuntu 
            Reporter: GURFAN
            Priority: Critical


Hi,

While parsing the content we are getting below exception in parse method.
The file which we are parsing is 1 mb.

TIKA JAR:  tika-core-1.3.jar
File size: 1 MB.

Parser parser = new AutoDetectParser();
parser.parse(is, handler, metaData, new ParseContext());


java.lang.OutOfMemoryError: Java heap space
	at java.util.Arrays.copyOf(Arrays.java:2734)
	at java.util.ArrayList.ensureCapacity(ArrayList.java:167)
	at java.util.ArrayList.add(ArrayList.java:351)
	at org.apache.fontbox.ttf.GlyfCompositeDescript.(GlyfCompositeDescript.java:60)
	at org.apache.fontbox.ttf.GlyphData.initData(GlyphData.java:63)
	at org.apache.fontbox.ttf.GlyphTable.initData(GlyphTable.java:71)
	at org.apache.fontbox.ttf.AbstractTTFParser.parseTables(AbstractTTFParser.java:163)
	at org.apache.fontbox.ttf.TTFParser.parseTables(TTFParser.java:61)
	at org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:90)
	at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
	at org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:66)
	at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
	at org.apache.tika.parser.font.TrueTypeParser.parse(TrueTypeParser.java:65)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
	at com.impetus.vajra.parser.tika.TikaParser.processContent(TikaParser.java:96)
	at com.impetus.vajra.storm.helper.TextAnalyserBoltHelper.execute(TextAnalyserBoltHelper.java:283)
	at com.impetus.vajra.storm.TextAnalyserBolt.execute(TextAnalyserBolt.java:182)
	at backtype.storm.daemon.executor$fn__4050$tuple_action_fn__4052.invoke(executor.clj:566)
	at backtype.storm.daemon.executor$mk_task_receiver$fn__3976.invoke(executor.clj:345)
	at backtype.storm.disruptor$clojure_handler$reify__1606.onEvent(disruptor.clj:43)
	at backtype.storm.utils.DisruptorQueue.consumeBatchToCursor(DisruptorQueue.java:84)
	at backtype.storm.utils.DisruptorQueue.consumeBatchWhenAvailable(DisruptorQueue.java:58)
	at backtype.storm.disruptor$consume_batch_when_available.invoke(disruptor.clj:62)
	at backtype.storm.daemon.executor$fn__4050$fn__4059$fn__4106.invoke(executor.clj:658)
	at backtype.storm.util$async_loop$fn__465.invoke(util.clj:377)
	at clojure.lang.AFn.run(AFn.java:24)
	at java.lang.Thread.run(Thread.java:662)



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message