tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Allison (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1865) Save sender email address in Outlook MSG metadata
Date Wed, 01 Mar 2017 18:49:45 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15890786#comment-15890786
] 

Tim Allison commented on TIKA-1865:
-----------------------------------

[~mcaruanagalizia], I've added quite a few more Metadata keys under Office and Message for
the sender, and I've updated the MSG parser.  I still need to update the other message parsers.

I'm not thrilled with putting the MAPI specific metadata items in the Office object...perhaps
a separate class to handle them?, and I don't like the divide between MAPI and Message, but
there really are some things that are specific to MAPI but don't apply to RFC.

I added individual keys for the components of exchange addresses {{"/o=blah/ou=blah/cn=recipients/cn=actual
name"}}.

Let me know what you think.  As a side note, we just switched from Apache's git to GitHub.
 We haven't re-calibrated Jenkins so there isn't a nightly build yet.  You'll have to grab
from GitHub and build yourself for now.

> Save sender email address in Outlook MSG metadata
> -------------------------------------------------
>
>                 Key: TIKA-1865
>                 URL: https://issues.apache.org/jira/browse/TIKA-1865
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.11
>         Environment: Windows 7 x64, jre 1.8.0_60 x64
>            Reporter: Luis Filipe Nassif
>         Attachments: report.xlsx
>
>
> Sender email address is lost when extracting metadata from Outlook msg files. Currently
only sender name is extracted. That is an important information to be extracted for search
engines.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message