tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Milos Kovacevic" <for.mi...@gmail.com>
Subject Parsing incomplete PDF and Office files
Date Thu, 13 Nov 2008 20:04:33 GMT

I would like to download just a few kilobytes of a PDF(doc) file and to
extract the text from it. I do not want to download the whole file and then
to parse it, just truncated first N Kbs. Is it possible with Tika or not? If
not how should I do that?

Regards, Milos

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message