[poppler] poppler util pdftohtml

Leonard Rosenthol lrosenth at adobe.com
Fri Sep 23 04:12:28 PDT 2011


On 9/23/11 6:38 AM, "Jonathan Kew" <jfkthame at googlemail.com> wrote:
>Once you start dealing with whole paragraphs, multiple columns, table
>cells, etc, etc, things only get worse.... you may get good results for a
>limited class of documents (e.g. unidirectional LTR text, fairly simple
>block layouts), but the general problem for arbitrary PDF documents is
>MUCH harder.

Agreed 100%!

Which is why I WISH I convince more PDF production tools to generated
tagged/structured PDF!

Leonard



More information about the poppler mailing list