tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris A. Mattmann (JIRA)" <j...@apache.org>
Subject [jira] Updated: (TIKA-99) Support external parser programs
Date Sat, 12 Apr 2008 00:43:07 GMT

     [ https://issues.apache.org/jira/browse/TIKA-99?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Chris A. Mattmann updated TIKA-99:
----------------------------------

    Component/s: parser

> Support external parser programs
> --------------------------------
>
>                 Key: TIKA-99
>                 URL: https://issues.apache.org/jira/browse/TIKA-99
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Priority: Minor
>
> There should be a parser component (like ExternalParser) that invokes an external command
line application, feeds the given document as input to the application, and returns the output
from the application as the extracted text (or xhtml) content. This would allow integration
with tools like catdoc or pdf2txt.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message