tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-2899) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.rtf.RTFParser@375a26af
Date Thu, 18 Jul 2019 20:07:00 GMT

    [ https://issues.apache.org/jira/browse/TIKA-2899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16888290#comment-16888290
] 

Hudson commented on TIKA-2899:
------------------------------

SUCCESS: Integrated in Jenkins build Tika-trunk #1682 (See [https://builds.apache.org/job/Tika-trunk/1682/])
TIKA-2899 -- improve robustness of list handling in the RTFParser (tallison: [https://github.com/apache/tika/commit/620134b90b72632fee486ba3aa5b25ff6b271d22])
* (edit) tika-parsers/src/main/java/org/apache/tika/parser/rtf/TextExtractor.java
* (edit) tika-core/src/test/java/org/apache/tika/TikaTest.java


> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.rtf.RTFParser@375a26af
> -----------------------------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-2899
>                 URL: https://issues.apache.org/jira/browse/TIKA-2899
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.19
>            Reporter: Pandurang
>            Assignee: Tim Allison
>            Priority: Critical
>             Fix For: 1.22
>
>         Attachments: 169910.rtf, 658656.rtf, ABC_PL_WI.rtf
>
>
> I am using Solr 8.0 by using solrnet liabrary we extracting some binary data to text.
In that case we are getting below error.
> Its working fine for 99 % documents but its failing for only 1 % docs
> Caused by: org.apache.tika.exception.TikaException: Unexpected RuntimeException from
org.apache.tika.parser.rtf.RTFParser@375a26af
>  at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:282)
>  at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>  at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
>  at org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:228)
>  ... 41 more



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Mime
View raw message