tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Allison (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1865) Save sender email address in Outlook MSG metadata
Date Wed, 01 Mar 2017 18:49:45 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15890786#comment-15890786

Tim Allison commented on TIKA-1865:

[~mcaruanagalizia], I've added quite a few more Metadata keys under Office and Message for
the sender, and I've updated the MSG parser.  I still need to update the other message parsers.

I'm not thrilled with putting the MAPI specific metadata items in the Office object...perhaps
a separate class to handle them?, and I don't like the divide between MAPI and Message, but
there really are some things that are specific to MAPI but don't apply to RFC.

I added individual keys for the components of exchange addresses {{"/o=blah/ou=blah/cn=recipients/cn=actual

Let me know what you think.  As a side note, we just switched from Apache's git to GitHub.
 We haven't re-calibrated Jenkins so there isn't a nightly build yet.  You'll have to grab
from GitHub and build yourself for now.

> Save sender email address in Outlook MSG metadata
> -------------------------------------------------
>                 Key: TIKA-1865
>                 URL: https://issues.apache.org/jira/browse/TIKA-1865
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.11
>         Environment: Windows 7 x64, jre 1.8.0_60 x64
>            Reporter: Luis Filipe Nassif
>         Attachments: report.xlsx
> Sender email address is lost when extracting metadata from Outlook msg files. Currently
only sender name is extracted. That is an important information to be extracted for search

This message was sent by Atlassian JIRA

View raw message