[Poppler-bugs] [Bug 67710] New: Tagged-PDF: LBody tag is not supported
bugzilla-daemon at freedesktop.org
bugzilla-daemon at freedesktop.org
Sat Aug 3 05:46:29 PDT 2013
https://bugs.freedesktop.org/show_bug.cgi?id=67710
Priority: medium
Bug ID: 67710
Assignee: poppler-bugs at lists.freedesktop.org
Summary: Tagged-PDF: LBody tag is not supported
Severity: normal
Classification: Unclassified
OS: All
Reporter: apinheiro at igalia.com
Hardware: Other
Status: NEW
Version: unspecified
Component: general
Product: poppler
Created attachment 83578
--> https://bugs.freedesktop.org/attachment.cgi?id=83578&action=edit
Dumb test that can be used to reproduce the bug
STEPS TO REPRODUCE IT:
1. Use patches on bug 64816 in order to have a tool to scan tagged pdf (note:
support to get that scanned is already on master)
2. Use one of those tools (ie: pdfinfo -struct-text) and scan the document
attached with this bug report.
EXPECTED OUTCOME:
Document properly parsed without warnings, structure and content properly
printed
ACTUAL OUTCOME:
Executing pdfinfo -struc-text (and fwiw. pdfstructhtml) prints the following
warnings:
Syntax Error: StructElem object is wrong type (LBody)
Syntax Error: StructElem object is wrong type (LBody)
Syntax Error: StructElem object is wrong type (LBody)
The text of the list items are not properly extracted/printed
EXTRA NOTES:
I already checked that the problem is not at the tools, but at the core
tagged-pdf. Specifically, with bug 64815, StructElement was added, with a
typeMap structure with all the tags supported. LBody was missing. LBody is a
valid tag, defined at page 586 of the reference (PDF32000_2008.pdf).
--
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20130803/aba3e738/attachment-0001.html>
More information about the Poppler-bugs
mailing list