[Poppler-bugs] [Bug 32522] Some letters are in wrong order in the output of pdftotext

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Thu Dec 23 00:32:22 PST 2010


https://bugs.freedesktop.org/show_bug.cgi?id=32522

--- Comment #12 from Adrian Johnson <ajohnson at redneon.com> 2010-12-23 00:32:22 PST ---
(In reply to comment #9)
> sorry for the typo:
> With copy and paste it's ok. and with text extraction it gives an extra U+FFFD
> characters with some words.

The sample PDF in comment 0 has ActualText around some of the glyphs that maps
the glyphs to U+FFFD. ie

  /Span<</ActualText<FEFFFFFD>>> BDC 
  0 0.02 TD
  <0067>Tj
  EMC 

If the U+FFFD character should not appeared in extracted text, it is a problem
with the application that created the PDF.

-- 
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the Poppler-bugs mailing list