tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tran Nam Quang (JIRA)" <j...@apache.org>
Subject [jira] [Issue Comment Edited] (TIKA-623) Add support for Outlook PST
Date Sat, 02 Apr 2011 16:32:05 GMT

    [ https://issues.apache.org/jira/browse/TIKA-623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13015046#comment-13015046
] 

Tran Nam Quang edited comment on TIKA-623 at 4/2/11 4:30 PM:
-------------------------------------------------------------

Cool! I'll start writing the Tika parser as soon as I can. Could take a couple of days though.

Richard, I have one question regarding the API: PSTMessage has two methods, getDescriptorNodeId()
and getInternetMessageId(). Both return identifiers, apparently. My question is: Which one
is an unique identifier that will never, ever change? Cause I wouldn't want the Tika parser
to extract identifiers that are "internal-only" and not unique.

Btw, maybe it's a good idea to also clarify this in the Javadoc.

      was (Author: qforce):
    Cool! I'll start writing the Tika parser as soon as I can. Could take a couple of days
though.

Richard, I have one question regarding the API: PSTMessage has two methods, getDescriptorNodeId()
and getInternetMessageId(). Both return identifiers, apparently. My question is: Which one
is an unique identifier that will never, ever change? Cause I wouldn't want the Tika parser
to extract identifiers that are "internal-only" and not unique.
  
> Add support for Outlook PST
> ---------------------------
>
>                 Key: TIKA-623
>                 URL: https://issues.apache.org/jira/browse/TIKA-623
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Tran Nam Quang
>
> Hello everyone,
> As you might know, Outlook stores its mails and other stuff in a single PST file. There's
a relatively new Java library called java-libpst for reading Outlook PST files. It is licensed
under the LGPL and available over here: http://code.google.com/p/java-libpst/
> I have tested the library on Outlook 2000 and Outlook 2003, with good results. It would
be great if the library could be integrated into Tika.
> Best regards
> Tran Nam Quang

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message