tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sara Miller (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (TIKA-2288) Remove metadata within body-element in OutlookExtractor
Date Fri, 03 Mar 2017 12:29:45 GMT

     [ https://issues.apache.org/jira/browse/TIKA-2288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Sara Miller updated TIKA-2288:
------------------------------
    Description: 
Tika's OutlookExtractor.java is not consistent with other mailparsers. 

It would be nice to get the content of the mail in the body element in the same way as other
mailparsers. 

Today, additional metadata such as sender, retriever, attachment is added to the body element.


Source code: https://github.com/Silobreaker/tika/blob/master/tika-parsers/src/main/java/org/apache/tika/parser/microsoft/OutlookExtractor.java#L190

  was:
Tika's OutlookExtractor.java is not consistent with other mailparsers. 

It would be nice to get the content of the mail in the body element in the same way as other
mailparsers. 

Now additional metadata such as sender, retriever, attachment is added to the body element.


Source code: https://github.com/Silobreaker/tika/blob/master/tika-parsers/src/main/java/org/apache/tika/parser/microsoft/OutlookExtractor.java#L190


> Remove metadata within body-element in OutlookExtractor
> -------------------------------------------------------
>
>                 Key: TIKA-2288
>                 URL: https://issues.apache.org/jira/browse/TIKA-2288
>             Project: Tika
>          Issue Type: Wish
>          Components: parser, server
>    Affects Versions: 1.14
>            Reporter: Sara Miller
>
> Tika's OutlookExtractor.java is not consistent with other mailparsers. 
> It would be nice to get the content of the mail in the body element in the same way as
other mailparsers. 
> Today, additional metadata such as sender, retriever, attachment is added to the body
element. 
> Source code: https://github.com/Silobreaker/tika/blob/master/tika-parsers/src/main/java/org/apache/tika/parser/microsoft/OutlookExtractor.java#L190



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message