tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sara Miller (JIRA)" <j...@apache.org>
Subject [jira] [Created] (TIKA-2288) Remove metadata within body-element in OutlookExtractor
Date Fri, 03 Mar 2017 12:28:45 GMT
Sara Miller created TIKA-2288:

             Summary: Remove metadata within body-element in OutlookExtractor
                 Key: TIKA-2288
                 URL: https://issues.apache.org/jira/browse/TIKA-2288
             Project: Tika
          Issue Type: Wish
          Components: parser, server
    Affects Versions: 1.14
            Reporter: Sara Miller

Tika's OutlookExtractor.java is not consistent with other mailparsers. 

It would be nice to get the content of the mail in the body element in the same way as for
other mailparsers. 

Now additional metadata such as sender, retriever, attachment is added to the body element.

Source code: https://github.com/Silobreaker/tika/blob/master/tika-parsers/src/main/java/org/apache/tika/parser/microsoft/OutlookExtractor.java#L190

This message was sent by Atlassian JIRA

View raw message