tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Allison (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-2167) Image processing causes OCR to fail
Date Tue, 08 Nov 2016 14:56:58 GMT

    [ https://issues.apache.org/jira/browse/TIKA-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15647770#comment-15647770
] 

Tim Allison commented on TIKA-2167:
-----------------------------------

Ah, ok, got it.  I suspect once we fix TIKA-2169, you should be good to go.  Will work on
that this afternoon.  Thank you for opening this ticket.

> Image processing causes OCR to fail
> -----------------------------------
>
>                 Key: TIKA-2167
>                 URL: https://issues.apache.org/jira/browse/TIKA-2167
>             Project: Tika
>          Issue Type: Bug
>          Components: ocr
>    Affects Versions: 1.14
>         Environment: Mac OS X 10.11.6; Java 1.8.0_45; tesseract 3.04.01; ImageMagick
6.9.6-2
>            Reporter: Matthew Caruana Galizia
>            Priority: Critical
>              Labels: convert, image, ocr, tiff
>         Attachments: simple.tiff
>
>
> Image processing before OCR is enabled by default in the OCR configuration properties
file. Unless this is disabled, running Tika on a simple TIFF image (attached) with two clear
words fails. When image processing is disabled, it succeeds.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message