lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From saisantoshi <>
Subject Re: Readers for extracting textual info from pd/doc/excel for indexing the actual content
Date Tue, 05 Feb 2013 21:17:47 GMT
I am looking at the versions supported by newer version of Tika (1.3) and was
not sure what version(s) of the Microsoft office it supports
(97/2000/2010/2013) for each of the below?

Microsoft word (also does it support bot docx and doc formats)
Microsoft Excel (pptx and ppt)
Microsoft PPT  (xlsx and xls)

Appreciate if you could point me to any link available that lists out all
the supported versions for the above? 

View this message in context:
Sent from the Lucene - Java Users mailing list archive at

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message