tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Erik Hetzner (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1182) Out of memory exception when parsing TTF file
Date Sat, 12 Oct 2013 19:02:43 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13793443#comment-13793443
] 

Erik Hetzner commented on TIKA-1182:
------------------------------------

For what it's worth, increasing max heap to 16gb does not help.

{code}
java -Xmx16000m -cp .:tika-app-1.4.jar TIKA_1182 
error: array index out of bounds
Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
	at org.apache.fontbox.ttf.GlyfCompositeDescript.<init>(GlyfCompositeDescript.java:59)
	at org.apache.fontbox.ttf.GlyphData.initData(GlyphData.java:63)
	at org.apache.fontbox.ttf.GlyphTable.initData(GlyphTable.java:71)
	at org.apache.fontbox.ttf.AbstractTTFParser.parseTables(AbstractTTFParser.java:163)
	at org.apache.fontbox.ttf.TTFParser.parseTables(TTFParser.java:61)
	at org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:90)
	at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
	at org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:66)
	at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
	at org.apache.tika.parser.font.TrueTypeParser.parse(TrueTypeParser.java:65)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
	at TIKA_1182.main(TIKA_1182.java:19)
{code}

> Out of memory exception when parsing TTF file
> ---------------------------------------------
>
>                 Key: TIKA-1182
>                 URL: https://issues.apache.org/jira/browse/TIKA-1182
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.4
>         Environment: Ubuntu
> java version "1.7.0_40"
> Java(TM) SE Runtime Environment (build 1.7.0_40-b43)
> Java HotSpot(TM) 64-Bit Server VM (build 24.0-b56, mixed mode)
>            Reporter: Erik Hetzner
>         Attachments: 16A4FF_8.ttf, TIKA_1182.java
>
>
> When parsing attached file using tika-app-1.4.jar, CPU usage is high and it never seems
to finished.
> When parsing using attached java code, I get an out of memory exception.
> Let me know what other information I can provide.
> Thank you!



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message