tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Grant Ingersoll (JIRA)" <j...@apache.org>
Subject [jira] Commented: (TIKA-169) Tika Web Service Servlet
Date Mon, 24 Nov 2008 17:33:44 GMT

    [ https://issues.apache.org/jira/browse/TIKA-169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650252#action_12650252

Grant Ingersoll commented on TIKA-169:

Also, the file system traversal feature seems a bit outside the scope of Tika, though having
something like this in a contrib area might be nice.

I believe Droids (crawling) has integrated Tika already as well.  But, yeah, as optional contribs,
those make sense.  We will have lots of dependencies on extraction libraries as it is, so
I really think it makes sense to stay as lean as possible elsewhere.  Before you know it,
Tika will be a 50-100 MB download, and that will slow adoption...

> Tika Web Service Servlet
> ------------------------
>                 Key: TIKA-169
>                 URL: https://issues.apache.org/jira/browse/TIKA-169
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.2
>            Reporter: Rida Benjelloun
>            Priority: Minor
>         Attachments: tikaServlet.war
> Tika servlet, use file or directory path to build a list of XML documents. The next version
will allow file upload.
> Usage :
> //Extract document content and metadata
> http://localhost:8080/tikaServlet/?filePath=C:\test&start=0&rows=10
> //Extract metadata
> http://localhost:8080/tikaServlet/?filePath=C:\test&start=0&rows=10&extract=metadata
> //Extract document content
> http://localhost:8080/tikaServlet/?filePath=C:\test&start=0&rows=10&extract=content

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message