tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Milos Kovacevic" <for.mi...@gmail.com>
Subject Re: Parsing incomplete PDF and Office files
Date Fri, 14 Nov 2008 07:32:59 GMT

> That's currently not possible, but AFAIK there is support for
> page-by-page streaming in PDFBox (for PDF documents that support that,
> not all of them do). It would be nice if Tika could leverage that
> functionality in PDFBox.

could you please give an example how to parse PDF page-by-page?
Thanks, Milos

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message