tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Thejan Wijesinghe (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-2293) Tess4jOCRParser - A simpler Java version of TesseractOCRParser
Date Fri, 05 May 2017 13:03:04 GMT

    [ https://issues.apache.org/jira/browse/TIKA-2293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15998298#comment-15998298
] 

Thejan Wijesinghe commented on TIKA-2293:
-----------------------------------------

Thank you [~tallison@mitre.org] . I'll post in the dev once I finalize the documentation for
this.

>  Tess4jOCRParser - A simpler Java version of TesseractOCRParser
> ---------------------------------------------------------------
>
>                 Key: TIKA-2293
>                 URL: https://issues.apache.org/jira/browse/TIKA-2293
>             Project: Tika
>          Issue Type: Improvement
>          Components: ocr
>            Reporter: Thejan Wijesinghe
>
> Right now, TesseractOCRParser calls tesseract and imagemagick from command line. Intention
of this new parser "Tess4jOCRParser" is to use the Tess4J API instead of the runtime.exec
way to executing tesseract out of process.  



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message