tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nick Burch (JIRA)" <j...@apache.org>
Subject [jira] Commented: (TIKA-442) Image extractors use inconsistent metadata keys and formats for common features
Date Wed, 16 Jun 2010 21:58:26 GMT

    [ https://issues.apache.org/jira/browse/TIKA-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12879532#action_12879532
] 

Nick Burch commented on TIKA-442:
---------------------------------

OK, I'll work up a patch that uses these keys, hopefully some time next week

If you get a chance in the mean time, do please add those entries to the XMPDM class ready
:)

> Image extractors use inconsistent metadata keys and formats for common features
> -------------------------------------------------------------------------------
>
>                 Key: TIKA-442
>                 URL: https://issues.apache.org/jira/browse/TIKA-442
>             Project: Tika
>          Issue Type: Improvement
>          Components: metadata, parser
>    Affects Versions: 0.7
>            Reporter: Nick Burch
>            Priority: Minor
>
> Currently Tika has a number of parsers for image formats, but the way they return their
data is inconsistent. For example:
> Jpeg: "Image Width" = "420 pixels", "Data Precision" = "8 bits"
> Gif: "width" = "420"
> Png: "width" = "420", "IHDR" = ".... bitDepth = 8 ....."
> Bmp: "width" = "420", "BitsPerSample" = "8 8 8"
> I think that the common keys, such as width and height, should be returned in a consistent
format of key and value. If someone would like to suggest the namespace for this (maybe under
XMDPM), and the short or long form (eg 420 vs 420 pixels), then I'm happy to work up a patch
for this

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message