tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Simone Tripodi <simone.trip...@gmail.com>
Subject Re: HTML to PDF conversion
Date Mon, 14 Oct 2019 15:33:42 GMT
Hi Sergey,

even if a little outdated, I would like to point to an old article I
co-operated with another long time ASF member Christian Grobmeier,
about an efficient pipeline for PDF generation using APache Cocoon3
and Apache FOP.

In your case your pipeline would be HTML -> HTML Tidy -> FOP -> PDF

HTH!
Best,
~Simo

[1] https://grobmeier.solutions/create-pdf-cocoon-3-struts-2-15112011.html

http://people.apache.org/~simonetripodi/
http://www.99soft.org/

On Mon, Oct 14, 2019 at 1:39 PM Sergey Beryozkin <sberyozkin@gmail.com> wrote:
>
> Hi All
>
> I've seen a Quarkus user asking how to convert to PDF, and one of my
> colleagues pointed to
> http://www.allcolor.org/YaHPConverter/doc/org/allcolor/yahp/converter/IHtmlToPdfTransformer.html
>
> Does it make sense for Tika to offer something related to the text to PDF
> (for a start, something on top of that transformer), and then may be even
> for other formats ?
>
> Sergey

Mime
View raw message