ctakes-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hari, Sekhar" <sekhar.h...@cgi.com>
Subject RE: Image to text conversion
Date Thu, 30 Apr 2015 05:21:22 GMT
Thanks. Let me try this, and will let you know for any help if required.

Sekhar H.

-----Original Message-----
From: Mattmann, Chris A (3980) [mailto:chris.a.mattmann@jpl.nasa.gov] 
Sent: Thursday, April 30, 2015 10:44 AM
To: dev@ctakes.apache.org; user@ctakes.apache.org
Subject: Re: Image to text conversion

What about using Apache Tika within cTAKES for this? Tika supports OCR through Tesseract:



Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory
Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
Adjunct Associate Professor, Computer Science Department University of Southern California,
Los Angeles, CA 90089 USA

-----Original Message-----
From: <Hari>, Sekhar <sekhar.hari@cgi.com>
Reply-To: "dev@ctakes.apache.org" <dev@ctakes.apache.org>
Date: Wednesday, April 29, 2015 at 10:11 PM
To: "dev@ctakes.apache.org" <dev@ctakes.apache.org>, "user@ctakes.apache.org" <user@ctakes.apache.org>
Subject: Image to text conversion

>Hello All -
>I am looking for an OCR ability in cTAKES. The requirement is to 
>convert scanned image documents (ex: scanned hand written 
>prescriptions) into a text format. Then apply the usual NLP pipeline to 
>convert the unstructured text to a structured data.
>Can cTAKES convert scanned image documents into a text? If so, please 
>help me to understand this by sharing any documents or video.
>Many thanks,
>Sekhar H.

View raw message