tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Simone Tripodi <simone.trip...@gmail.com>
Subject Re: HTML to PDF conversion
Date Mon, 14 Oct 2019 15:33:42 GMT
Hi Sergey,

even if a little outdated, I would like to point to an old article I
co-operated with another long time ASF member Christian Grobmeier,
about an efficient pipeline for PDF generation using APache Cocoon3
and Apache FOP.

In your case your pipeline would be HTML -> HTML Tidy -> FOP -> PDF


[1] https://grobmeier.solutions/create-pdf-cocoon-3-struts-2-15112011.html


On Mon, Oct 14, 2019 at 1:39 PM Sergey Beryozkin <sberyozkin@gmail.com> wrote:
> Hi All
> I've seen a Quarkus user asking how to convert to PDF, and one of my
> colleagues pointed to
> http://www.allcolor.org/YaHPConverter/doc/org/allcolor/yahp/converter/IHtmlToPdfTransformer.html
> Does it make sense for Tika to offer something related to the text to PDF
> (for a start, something on top of that transformer), and then may be even
> for other formats ?
> Sergey

View raw message