tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From doykle <...@git.apache.org>
Subject [GitHub] tika pull request: Psd parser
Date Sun, 10 Jan 2016 21:40:15 GMT
GitHub user doykle opened a pull request:

    https://github.com/apache/tika/pull/69

    Psd parser

    Stepping through the byte stream to find text layer content ran into two blocks:
    + Undocumented 4 bytes `Txt ` coming before the expected `TEXT` key
    + If the first key consumed is not `Txt TEXT`, the key which IS consumed must be supported
in order to get to the next key. The size of the data each key identifies is not easily determined.
  
    
    This implementation places the entire block of keyed data into a byte array and scans
for the `TEXT` key. 
    
    The dead-end code is commented out, saved for an especially rainy day when we can add
support for all the keys and do this properly. (Should I do the right thing and put it on
a separate branch?)
    
    References:
    PSD File Specification from 2013:
    http://www.adobe.com/devnet-apps/photoshop/fileformatashtml/#50577409_pgfId-1031423
    Unofficial PSD anatomy lesson from 2014:
    https://github.com/layervault/psd.rb/wiki/Anatomy-of-a-PSD-File

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/rastech/tika psd-parser

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/tika/pull/69.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #69
    
----
commit b3656b0c0f2ce2b7508086d64462ad82c9505556
Author: Kevin <kevin@meta.sc>
Date:   2016-01-10T02:02:28Z

    Extract layer names and some text fields

commit 6fee54dd41fb8cc1c5ffa51590dcf7bb45695fed
Author: Kevin <kevin@meta.sc>
Date:   2016-01-10T09:15:42Z

    Variables declared outside of loop

commit 27c65309be72de030d61ebb9991d214cddf7eba8
Author: Kevin <kevin@meta.sc>
Date:   2016-01-10T21:04:10Z

    Add layer names to existing photoshop metadata tags

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

Mime
View raw message