tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nick Burch (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-507) Parser for font files
Date Fri, 20 Jan 2012 15:59:40 GMT

    [ https://issues.apache.org/jira/browse/TIKA-507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13189861#comment-13189861

Nick Burch commented on TIKA-507:

Thanks for this patch, sorry it has taken so long to get to!

Looking at the supplied .afm files, it looks like they're copyright. As such, I've tweaked
the tests to use the sample .afm we already have, which is a specially generated one for Tika
(so no copyright problems)

Parser added (with a few tweaks) in r1233973.

Now I guess the next thing is to get some suitably licensed .pfm/.pfa/.pfb files, then look
at a parser for those!
> Parser for font files
> ---------------------
>                 Key: TIKA-507
>                 URL: https://issues.apache.org/jira/browse/TIKA-507
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>         Attachments: AdobeFontMetricParser.zip, TIKA-507.Arreola.110724.patch.txt
> The FontBox library used by PDFBox supports various kinds of font information files.
These files don't typically contain much useful textual data, but they do have interesting
metadata that should be made available also through Tika.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message