tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris A. Mattmann (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1047) Provide a JAX-RS to detect only the mediatype
Date Sun, 27 Jan 2013 19:17:12 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13563888#comment-13563888
] 

Chris A. Mattmann commented on TIKA-1047:
-----------------------------------------

All tests pass now and build works:

{noformat}
[INFO] 
[INFO] 
[INFO] ------------------------------------------------------------------------
[INFO] Reactor Summary:
[INFO] ------------------------------------------------------------------------
[INFO] Apache Tika parent .................................... SUCCESS [2.218s]
[INFO] Apache Tika core ...................................... SUCCESS [13.617s]
[INFO] Apache Tika parsers ................................... SUCCESS [48.548s]
[INFO] Apache Tika XMP ....................................... SUCCESS [1.497s]
[INFO] Apache Tika application ............................... SUCCESS [12.264s]
[INFO] Apache Tika OSGi bundle ............................... SUCCESS [18.735s]
[INFO] Apache Tika server .................................... SUCCESS [16.963s]
[INFO] Apache Tika ........................................... SUCCESS [0.153s]
[INFO] ------------------------------------------------------------------------
[INFO] ------------------------------------------------------------------------
[INFO] BUILD SUCCESSFUL
[INFO] ------------------------------------------------------------------------
[INFO] Total time: 1 minute 55 seconds
[INFO] Finished at: Sun Jan 27 11:15:01 PST 2013
[INFO] Final Memory: 99M/252M
[INFO] ------------------------------------------------------------------------
[chipotle:~/tmp/tika1.4] mattmann% 
{noformat}

                
> Provide a JAX-RS to detect only the mediatype
> ---------------------------------------------
>
>                 Key: TIKA-1047
>                 URL: https://issues.apache.org/jira/browse/TIKA-1047
>             Project: Tika
>          Issue Type: Wish
>          Components: server
>            Reporter: Raimund Merkert
>            Assignee: Chris A. Mattmann
>            Priority: Critical
>             Fix For: 1.4
>
>         Attachments: patch, TIKA-1047.rmerkert.012713.patch.txt
>
>
> Currently I can use the JAX-RS server to detect the mediatype using the meta endpoint.
The problem I have with this is that I need to send the entire document to get all metadata.

> To detect the mediatype, only a few bytes are often necessary and so I'd like to only
send, say 8K or so, to the server and let it tell me the mediatype.
> In order to accomplish this, it would be good to modify the /meta endpoint to address
the individual fields that might be returned:
> /meta/mediatype
> /meta/author
> /meta/lastModified
> The parts currently following the path could be turned into a query parameter, which
I think is more appropriate anyways (also easier to manipulate with tools like jquery).
> If sufficient data is not available, I'd just return with a BAD_REQUEST.
> If this would be of interest to TIKA, I think I could possibly implement this.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message