[Poppler-bugs] [Bug 67710] New: Tagged-PDF: LBody tag is not supported

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sat Aug 3 05:46:29 PDT 2013


https://bugs.freedesktop.org/show_bug.cgi?id=67710

          Priority: medium
            Bug ID: 67710
          Assignee: poppler-bugs at lists.freedesktop.org
           Summary: Tagged-PDF: LBody tag is not supported
          Severity: normal
    Classification: Unclassified
                OS: All
          Reporter: apinheiro at igalia.com
          Hardware: Other
            Status: NEW
           Version: unspecified
         Component: general
           Product: poppler

Created attachment 83578
  --> https://bugs.freedesktop.org/attachment.cgi?id=83578&action=edit
Dumb test that can be used to reproduce the bug

STEPS TO REPRODUCE IT:

1. Use patches on bug 64816 in order to have a tool to scan tagged pdf (note:
support to get that scanned is already on master)
2. Use one of those tools (ie: pdfinfo -struct-text) and scan the document
attached with this bug report.


EXPECTED OUTCOME:
Document properly parsed without warnings, structure and content properly
printed 

ACTUAL OUTCOME:
Executing pdfinfo -struc-text (and fwiw. pdfstructhtml) prints the following
warnings:
Syntax Error: StructElem object is wrong type (LBody)
Syntax Error: StructElem object is wrong type (LBody)
Syntax Error: StructElem object is wrong type (LBody)

The text of the list items are not properly extracted/printed

EXTRA NOTES:
I already checked that the problem is not at the tools, but at the core
tagged-pdf. Specifically, with bug 64815, StructElement was added, with a
typeMap structure with all the tags supported. LBody was missing. LBody is a
valid tag, defined at page 586 of the reference (PDF32000_2008.pdf).

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20130803/aba3e738/attachment-0001.html>


More information about the Poppler-bugs mailing list