tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zachary Lee Jones (JIRA)" <j...@apache.org>
Subject [jira] [Created] (TIKA-2366) Add image cropping functionality to TesseractOCRParser
Date Tue, 16 May 2017 14:20:04 GMT
Zachary Lee Jones created TIKA-2366:

             Summary: Add image cropping functionality to TesseractOCRParser
                 Key: TIKA-2366
                 URL: https://issues.apache.org/jira/browse/TIKA-2366
             Project: Tika
          Issue Type: Improvement
          Components: ocr
    Affects Versions: 1.14
         Environment: ImageMagick-7.0.5, Tesseract 3.0.5
            Reporter: Zachary Lee Jones
            Priority: Trivial

I am using Tika's TesseractOCRParser to read scanned pdf files. It would be nice if I could
utilize ImageMagick's crop command through the TesseractOCRParser so that document headers/footers
can be ignored.

This message was sent by Atlassian JIRA

View raw message