tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matthew Caruana Galizia (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (TIKA-2167) Image processing causes OCR to fail
Date Sun, 06 Nov 2016 11:51:58 GMT

     [ https://issues.apache.org/jira/browse/TIKA-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Matthew Caruana Galizia updated TIKA-2167:
------------------------------------------
    Attachment: simple.tiff

> Image processing causes OCR to fail
> -----------------------------------
>
>                 Key: TIKA-2167
>                 URL: https://issues.apache.org/jira/browse/TIKA-2167
>             Project: Tika
>          Issue Type: Bug
>          Components: ocr
>    Affects Versions: 1.14
>         Environment: Mac OS X 10.11.6; Java 1.8.0_45; tesseract 3.04.01; ImageMagick
6.9.6-2
>            Reporter: Matthew Caruana Galizia
>            Priority: Critical
>              Labels: convert, image, ocr, tiff
>         Attachments: simple.tiff
>
>
> Image processing before OCR is enabled by default in the OCR configuration properties
file. Unless this is disabled, running Tika on a simple TIFF image (attached) with two clear
words fails. When image processing is disabled, it succeeds.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message