[poppler] pdftohtml, separate CSS file

Josh Richardson jric at chegg.com
Wed Jun 22 11:14:54 PDT 2011

I was planning to make it HTML5 and XHTML5 compliant (polyglot<http://en.wikipedia.org/wiki/Polyglot_markup>).  We need some of the HTML5 features, e.g. for text rotation across platforms.  I have implemented text-rotation support btw., just trying to sort some of this formatting stuff out before submitting the patch.

Thanks for the pointers.


From: "Marc J. Driftmeyer" <mjd at reanimality.com<mailto:mjd at reanimality.com>>
Reply-To: "Marc J. Driftmeyer" <mjd at reanimality.com<mailto:mjd at reanimality.com>>
Date: Tue, 21 Jun 2011 21:14:23 -0700
To: "poppler at lists.freedesktop.org<mailto:poppler at lists.freedesktop.org>" <poppler at lists.freedesktop.org<mailto:poppler at lists.freedesktop.org>>
Subject: Re: [poppler] pdftohtml, separate CSS file

As one with 10+ years with CSS, XHTML1.x and now HTML 5 I have to ask which versions of the XHTML specification you plan on supporting.

I would assume you would target XHTML1.1 Strict and leave the notion of the XHTML 1.1 Modular alone as we've all departed on to HTML 5.

Which brings me to the question, pdftohtml should include output to HTML 5, and since it's on all platforms perhaps one should utilize the WebKit HTML 5 Parser, especially since GTK+ and Qt are all in. GTK+ is even modularizing out their work so to separate the JavaScript engine to be reusable within other GTK+ projects.

More information about the poppler mailing list