[Poppler-bugs] [Bug 67710] Tagged-PDF: LBody tag is not supported

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Thu Sep 26 07:37:07 PDT 2013


https://bugs.freedesktop.org/show_bug.cgi?id=67710

--- Comment #7 from Adrian Perez de Castro <aperez at igalia.com> ---
(In reply to comment #3)
> Created attachment 83580 [details] [review]
> Add support to LBody to poppler
> 
> Fixes a typo on the previous patch.
> 
> Note: looking at the html from pdfstructtohtml I realized that listitems are
> exposed with an extra bullet point. Using pdfinfo -struct-text we have
> things like this:
> 
>       LI (block)
>         LBody
>           P (block):
>              /Placement /Block
>              /StartIndent 36
>             "•list item 1"
> 
> Not sure if the bullet point should be part of the text. Again, I hope that
> someone with more experience with the current code could reply that question.

Yes, the bullet glyph is part of the contents of the PDF. Note that the
structure tree is purely informative, and does not affect how things are
rendered. Therefore if a bullet is to be shown, it must be part of the
page command stream.

The “pdfstructtohtml” could be indeed be a bit smarter and do one of
(or a combination of both):

 - Checking the beginning of the text string and, if one of the usual
   bullet symbols is used (circle bullet, square bullet, etc), do not
   output the glyph inside the <li> elements and let the browser add
   the bullet.
 - Removing the bullet from the <li> elements using CSS, so the bullet
   glyph from the text is shown.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20130926/094e8b85/attachment.html>


More information about the Poppler-bugs mailing list