tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nick Burch <apa...@gagravarr.org>
Subject Re: MagicDetector don't work for all RFC882 message Types.
Date Thu, 11 Jul 2013 10:48:02 GMT
On Thu, 11 Jul 2013, Kai-Uwe Schmidt wrote:
> I am trying to use Tika to extract metadata from eml's created via 
> Novell Groupwise. By this I ran into a problem with the dedection of 
> "message/rfc822". The MagicDetector (working with the default 
> tika-mimetypes.xml) compares the "match" values binary. RFC822 describes 
> the header attributes are case independent (see 
> http://www.ietf.org/rfc/rfc0822.txt 3.4.7). So MIME-Version is the same 
> than Mime-Version

Best bet is to open a bug in jira, and upload a (small!) sample file that 
shows the problem. We'll need to tweak the mime rules to include that case 
combination too. (IIRC, the mime magic rules don't support case 
insensitive matching)

Nick

Mime
View raw message