tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Boris Petrov (JIRA)" <j...@apache.org>
Subject [jira] [Created] (TIKA-2730) parseToString fails for a simple mp3
Date Wed, 19 Sep 2018 10:13:00 GMT
Boris Petrov created TIKA-2730:
----------------------------------

             Summary: parseToString fails for a simple mp3
                 Key: TIKA-2730
                 URL: https://issues.apache.org/jira/browse/TIKA-2730
             Project: Tika
          Issue Type: Bug
    Affects Versions: 1.19
            Reporter: Boris Petrov
         Attachments: demo.mp3

This is a regression from 1.18. I've attached the mp3 that fails. The exception I get is:
{noformat}
org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from org.apache.tika.parser.mp3.Mp3Parser@cefe6c6
    at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:286)
    at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
    at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
    at org.apache.tika.Tika.parseToString(Tika.java:527)
    at com.company.TextExtractor.getText(TextExtractor.java:39)

    Caused by:
    java.io.EOFException: EOF: tried to skip 361 but could only skip 247
        at org.apache.tika.parser.mp3.MpegStream.skipFrame(MpegStream.java:166)
        at org.apache.tika.parser.mp3.Mp3Parser.getAllTagHandlers(Mp3Parser.java:204)
        at org.apache.tika.parser.mp3.Mp3Parser.parse(Mp3Parser.java:71)
        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
        ... 5 more{noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message