tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jukka Zitting <jukka.zitt...@gmail.com>
Subject Re: Microsoft Outlook (msg) files get parsed 50 times in TikaGUI
Date Wed, 04 Feb 2009 23:51:50 GMT

On Wed, Feb 4, 2009 at 12:00 PM, Jana, Kumar Raja <kjana@ptc.com> wrote:
> I was feeding various document formats to the TikaGUI tool and found
> that Microsoft Outlook (msg) files get parsed around 50 times!!!

Hmm, that's quite a lot... How does this "50 times" appear, do you get
50 copies of the message content in the extracted text output? Do you
have an example file that you could share with us?


Jukka Zitting

View raw message