tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Martijn van Groningen (JIRA)" <j...@apache.org>
Subject [jira] Created: (TIKA-586) Parsing a ms access file (*.mdb) throws an error
Date Mon, 17 Jan 2011 21:12:00 GMT
Parsing a ms access file (*.mdb) throws an error

                 Key: TIKA-586
                 URL: https://issues.apache.org/jira/browse/TIKA-586
             Project: Tika
          Issue Type: Bug
          Components: mime
    Affects Versions: 0.8
            Reporter: Martijn van Groningen
            Priority: Minor
             Fix For: 0.9

I know that parsing a ms access file (*.mdb) is not supported since there is not parser for
it, but I think it should not throw an exception. 
Currently when parsing a mdb file it is being recognized as a true font file. The TrueTypeParser
throws an parser specific error when encountering a mdb file.

Exception in thread "main" org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException
from org.apache.tika.parser.font.TrueTypeParser@6906daba
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:203)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
	at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:94)
	at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:273)
	at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:80)
Caused by: java.io.IOException: Unexpected end of TTF stream reached
	at org.apache.fontbox.ttf.TTFDataStream.read(TTFDataStream.java:217)
	at org.apache.fontbox.ttf.TTFDataStream.readString(TTFDataStream.java:69)
	at org.apache.fontbox.ttf.TTFDataStream.readString(TTFDataStream.java:57)
	at org.apache.fontbox.ttf.AbstractTTFParser.readTableDirectory(AbstractTTFParser.java:214)
	at org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:85)
	at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
	at org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:66)
	at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
	at org.apache.tika.parser.font.TrueTypeParser.parse(TrueTypeParser.java:63)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
	... 5 more

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message