[poppler] poppler util pdftohtml
Leonard Rosenthol
lrosenth at adobe.com
Fri Sep 23 04:12:28 PDT 2011
On 9/23/11 6:38 AM, "Jonathan Kew" <jfkthame at googlemail.com> wrote:
>Once you start dealing with whole paragraphs, multiple columns, table
>cells, etc, etc, things only get worse.... you may get good results for a
>limited class of documents (e.g. unidirectional LTR text, fairly simple
>block layouts), but the general problem for arbitrary PDF documents is
>MUCH harder.
Agreed 100%!
Which is why I WISH I convince more PDF production tools to generated
tagged/structured PDF!
Leonard
More information about the poppler
mailing list