tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jukka Zitting (JIRA)" <j...@apache.org>
Subject [jira] Commented: (TIKA-292) PDFBox is too verbose
Date Tue, 06 Jul 2010 12:49:49 GMT

    [ https://issues.apache.org/jira/browse/TIKA-292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12885524#action_12885524
] 

Jukka Zitting commented on TIKA-292:
------------------------------------

I removed this workaround in revision 960889, as it's no longer needed after PDFBOX-581.

> PDFBox is too verbose
> ---------------------
>
>                 Key: TIKA-292
>                 URL: https://issues.apache.org/jira/browse/TIKA-292
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>            Priority: Minor
>             Fix For: 0.5
>
>
> PDFBox 0.8 logs INFO messages for all PDF primitives that are not enabled in the respective
PDFBox configuration. Many of these primitives are explicitly not needed for text extraction,
so there's no point in logging so much about them.
> Until this is fixed in PDFBox, we should work around it in Tika.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message