tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Kai-Uwe Schmidt <...@bel-it.de>
Subject MagicDetector don't work for all RFC882 message Types.
Date Thu, 11 Jul 2013 10:04:48 GMT
Hello folks,

I am trying to use Tika to extract metadata from eml's created via Novell Groupwise. By this
I ran into  a problem with the dedection of "message/rfc822". The MagicDetector (working with
the default tika-mimetypes.xml) compares the "match" values binary. RFC822 describes the header
attributes are case independent (see http://www.ietf.org/rfc/rfc0822.txt 3.4.7). So MIME-Version
is the same than Mime-Version.

Is there a different way to get those EML's detected correctly?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message