tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris A. Mattmann (Updated) (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (TIKA-593) Tika network server
Date Tue, 27 Mar 2012 15:36:28 GMT

     [ https://issues.apache.org/jira/browse/TIKA-593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Chris A. Mattmann updated TIKA-593:

    Attachment: TIKA-593.Mattmann.032712.patch.txt

- a lot closer. Unpacker tests are failing. Max, how did Jersey deal with the Map<String,byte[]>
that you are returning in UnpackerResource? I don't see any @Providers in Jersey that natively
know how to deal with this data structure, nor do I see any @Provider classes that you have
written to take care of it. How was Jersey dealing with this?
> Tika network server
> -------------------
>                 Key: TIKA-593
>                 URL: https://issues.apache.org/jira/browse/TIKA-593
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.10
>            Reporter: Jukka Zitting
>            Assignee: Chris A. Mattmann
>             Fix For: 1.2
>         Attachments: TIKA-593.Mattmann.032612.patch.2.txt, TIKA-593.Mattmann.032612.patch.txt,
TIKA-593.Mattmann.032712.patch.txt, TIKA-593_pom.diff
> It would be cool to be able to run Tika as a network service that accepts a binary document
as input and produces the extracted content (as XHTML, text, or just metadata) as output.
A bit like TIKA-169, but without the dependency to a servlet container.
> I'd like to be able to set up and run such a server like this:
>     $ java -jar tika-app.jar --port 1234
> We should also add a NetworkParser class that acts as a local client for such a service.
This way a lightweight client could use the full set of Tika parsing functionality even with
just the tika-core jar within its classpath.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message