tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris A. Mattmann (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-94) Speech recognition
Date Sun, 01 Mar 2015 16:04:04 GMT

    [ https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14342304#comment-14342304

Chris A. Mattmann commented on TIKA-94:

I am pretty interested in coming up with a Parser that does this. It may take the same approach
as translation - relying on an external ASR service and/or if needed, allowing to "bring a
model to the table". However, feel free to close this one as you said Tyler and if I come
up with one I'll open a new issue.

> Speech recognition
> ------------------
>                 Key: TIKA-94
>                 URL: https://issues.apache.org/jira/browse/TIKA-94
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Priority: Minor
> Like OCR for image files (TIKA-93), we could try using speech recognition to extract
text content (where available) from audio (and video!) files.
> The CMU Sphinx engine (http://cmusphinx.sourceforge.net/) looks promising and comes with
a friendly license.

This message was sent by Atlassian JIRA

View raw message