tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jukka Zitting (JIRA)" <j...@apache.org>
Subject [jira] Created: (TIKA-94) Speech recognition
Date Mon, 12 Nov 2007 02:14:50 GMT
Speech recognition

                 Key: TIKA-94
                 URL: https://issues.apache.org/jira/browse/TIKA-94
             Project: Tika
          Issue Type: New Feature
            Reporter: Jukka Zitting
            Priority: Minor

Like OCR for image files (TIKA-93), we could try using speech recognition to extract text
content (where available) from audio (and video!) files.

The CMU Sphinx engine (http://cmusphinx.sourceforge.net/) looks promising and comes with a
friendly license.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message