tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Bob Paulin <...@bobpaulin.com>
Subject Tika 2.0 - Replace POI IOUtils with commons-io IOUtils
Date Sun, 27 Mar 2016 14:39:45 GMT

Currently the Apache POI dependency is in several modules and it's sort 
of a beast (> 2 MB in size).   It appears many of the modules are only 
using the IOUtils library.  The big exception is the office module which 
is responsible for parsing documents. These methods appear to also exist 
in commons io which is only ~ 180 KB. Any concerns with replacing this 
POI stuff with commons-io?  Does POI offer anything above the commons-io 
functionality in IOUtils? If not I think it would be great to isolate 
the poi dependency to the office module only.

- Bob

View raw message