tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Kai-Uwe Schmidt <...@bel-it.de>
Subject AW: MagicDetector don't work for all RFC882 message Types.
Date Thu, 11 Jul 2013 14:42:29 GMT
Where can I read how to provide a path? 

-----Urspr√ľngliche Nachricht-----
Von: Nick Burch [mailto:apache@gagravarr.org] 
Gesendet: Donnerstag, 11. Juli 2013 12:48
An: dev@tika.apache.org
Betreff: Re: MagicDetector don't work for all RFC882 message Types.

On Thu, 11 Jul 2013, Kai-Uwe Schmidt wrote:
> I am trying to use Tika to extract metadata from eml's created via 
> Novell Groupwise. By this I ran into a problem with the dedection of 
> "message/rfc822". The MagicDetector (working with the default
> tika-mimetypes.xml) compares the "match" values binary. RFC822 
> describes the header attributes are case independent (see 
> http://www.ietf.org/rfc/rfc0822.txt 3.4.7). So MIME-Version is the 
> same than Mime-Version

Best bet is to open a bug in jira, and upload a (small!) sample file that shows the problem.
We'll need to tweak the mime rules to include that case combination too. (IIRC, the mime magic
rules don't support case insensitive matching)


View raw message